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Preface 



“Foundations of the Formal Sciences” (FotFS) is a series of interdis- 
eiplinary eonferenees in mathematies, philosophy, eomputer seienee and 
linguisties. The main goal is to reestablish the traditionally strong links 
between these areas of researeh that have been lost in the past deeades. 

The first two eonferenees of the series took plaee in Berlin (May 
1999, Foundations of the Formal Sciences 7, organized by Benedikt Lowe 
and Florian Rudolph) and Bonn (November 2000, Foundations of the 
Formal Sciences II, organized by Peter Koepke, Benedikt Lowe, Wolf- 
gang Malzkorn, Thoralf Raseh, and Rainer Stuhlmann-Laeisz). Proeeed- 
ings volumes have appeared as a double speeial issue of the journal Syn- 
these [LowRudi02] and as a volume in this book series [LowMalRasOS]. 

In February 2003, we had the fourth eonferenee of the series FotFS IV 
on “The History of the Coneept of the Formal Seienees” in Bonn, and the 
fifth eonferenee FotFS V on “Infinite Games” is already in preparation 
for Fall 2004 as a joint aetivity of the Institute for Logic, Language and 
Computation in Amsterdam and the Mathematisches Institut in Bonn. 

The eonferenee Foundations of the Formal Sciences III was a PhD 

EuroConference in the European Community program “Improving Hu- 
man Researeh Potential and the Soeio-Eeonomie Knowledge Base” fun- 
ded under the eontraet number HPCF-2000-00040. It was also a satellite 
meeting of the joint meeting of the Deutsche Mathematikervereinigung 
(DMV) and the Osterreichische Mathematische Gesellschaft (OMG), the 
15. OMG-KongreB. 

The eonferenee EotES III took plaee in the Institut fUr Formale Logik 
of the Universitdt Wien loeated in the Josephinum. The Josephinum was 
founded in 1785 by Emperor Joseph II. as an aeademy for military doe- 
tors and still houses Vienna’s premier medieal museum (the Museum des 
Instituts fur Geschichte der Medizin) with a famous eolleetion of wax 
anatomieal figures. It was depleted on the baek of the old Austrian 50- 
Sehilling bill before the Euro was introdueed on January 1st, 2002 (see 
Eigure 1). The partieipants of the eonferenee enjoyed the blend of the his- 
torieal loeation elose to the eentre of Vienna and the modem equipment 
of the logie institute. 




PREFACE 



viii 




Fig. 1. The Josephinum on the 50-Schilling Bill 



Holding a conference on the Formal Sciences {Formalwissenschaf- 
ten) in Vienna is particularly fitting. The Formal Sciences (mathematics, 
computer science, formal philosophy and theoretical linguistics) have 
found their niches in the university cultures, mostly determined by the 
history of the subjects. In Germany, mathematics and computer science 
are often grouped with the Natural Sciences in a Mathematisch-Natur- 
wissenschaftliche Fakultdt where the explicit mention of mathematics 
seems to hint at its special nature distinct from the natural sciences. In 
the American system, mathematics can be both part of the Arts and part 
of the Sciences, whereas computer science is often associated with Engi- 
neering. 

One of the most prominent members of the Vienna Circle, Rudolf 
Carnap, stressed the special nature of the Formal Sciences [Cari35], and 
it seems that some relicts of this special status are still preserved in the 
Austrian university system: 
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IX 



Until recently, the Universitat Wien had a Natur- und Formalwis- 
senschaftliche Fakultdt (the Institut fUr Mathematik and the Institut fiir 
Formale Logik representing the Formal Sciences part) until it was re- 
named Fakultdt fiir Naturwissenschaften und Mathematik. The Wirt- 
schaftsuniversitdt Wien still has a Fachbereich fiir Sozial-, Geistes- und 
Formalwissenschaften, and in general, you will encounter the term For- 
malwissenschaften surprisingly often in the Austrian texts and discus- 
sions about academia and the classification of sciences, at least much 
more often than in Germany, the United States and other countries. 

During the Opening Ceremony of FotFS III, the Universitat Wien 
was represented by Marianne Popp, the Dean of the Faculty of Nat- 
ural Sciences and Mathematics. The participants were also greeted in 
the name of the Institut fur Mathematik of the Universitat Wien and the 
Osterreichische Mathematische Gesellschaft (OMG) by Karl Sigmund. 
In his role as the president of the OMG, Professor Sigmund was the chair 
of the organizing committee of the 15th OMG-KongreB. 

We would like to thank all speakers at the conference for contribut- 
ing to the success of the conference. In addition to that, we would like to 
thank the Institut fUr Formale Logik at the Universitat Wien, and in par- 
ticular, its director, Sy D. Friedman, for giving us the opportunity to have 
the conference in Vienna and for the institutional support and access to 
the institute infrastructure. We also would like to thank the Mathemati- 
sches Institut of the Rheinische Friedrich-Wilhelms-Universitat Bonn for 
the support during the preparation phase. Of course, a conference can’t be 
organized without a lot of help. We extend our thanks to all helpers at this 
conference and name them in alphabetic order: Gudrun Fankhaenel, An- 
dreas Metzler, Mag. Pavel Ondrejovic, Dipl. -Math. Philipp Rohde, David 
Schrittesser, Margit Svitil, Lie. Marc van Eijmeren. 

We’d also like to thank the series editor, Heinrich Wansing (Dresden), 
and Jolanda Voogd and Charles Erkelens at Kluwer Academic Publishers 
for their support during the planning and preparation of the proceedings 
volume. The ETgX-classfile fotfs.cls which was used for the lay- 
out of this volume is due to Dipl. -Math. Philipp Rohde (Aachen). We 
received technical assistance from Dipl.-Math. Stefan Bold (Bonn). 




Last but not least, we have to thank the numerous referees who helped 
us a lot to produce this volume by producing illuminating and competent 
reports. 



AmsterdamAVien/Potsdam, March 2003 



B.L. B.P Th.R. 
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Abstract. This paper is an introduction to the entire volume: the notions of re- 
duction functions and their derived complexity classes are introduced abstractly 
and connected to the areas covered by this volume. 



1 Introduction 

Logic is famous for what Hofstadter calls limitative theorems: Godel’s 
incompleteness theorems, Tarski’s result on the undefinability of truth, 
and Turing’s proof of the non-computability of the halting problem. For 
each of these limitative results we have three ingredients: 

1. an informal notion to be investigated (e.g., provability, expressibility, 
computability), 

2. a formalization of that notion (e.g . , provability as formalized in Peano 
arithmetic PA, definability in a formal language, the Turing com- 
putable functions). 
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3. and finally a theorem that there is some limitation to the formalized 
version of the informal notion {e.g., the theorems of Godel, Tarski 
and Turing). 

The limitative theorems split the world into cis and trans relative to 
a barrier and show that some objects are beyond the barrier {trans). In 
our examples, the statement “PA is consistent” Cons(PA) is beyond the 
barrier marked by the formal notion of provability in PA, a truth predi- 
cate satisfying the convention (T) is beyond the barrier of definability, 
and the halting problem is "'beyond the Turing limit” (Siegelmann). Even 
more importantly, these results relativize: the notion of formal provabil- 
ity in the stronger system PA -f Cons(PA) gives us another barrier, but 
for this barrier Cons(PA) lies cis, and Cons(PA -f Cons(PA)) lies trans. 
This calls for iteration, and instead of seeing limitative theorems mainly 
as obstacles, we can see their barriers positively, as a means of defining 
relative complexity hierarchies.^ This is the approach we chose for this 
introductory paper. 

Let us illustrate the problems and methodology of limitative results 
in some examples: 

(Example 1) Is there an effective algorithm to determine whether 
a number is prime ? 

The informal notion involved in this question is the notion of “ef- 
fective algorithm”. In order to give a positive answer to the question in 
(Example 1), you do not necessarily need a metatheoretical apparatus 
or even a proper formal definition of what an effective algorithm is. You 
have to display an algorithm, and convince people that it is effective. 

But this pragmatic approach is not possible if you want to answer 
such a question in the negative. Consider the following statement: 

(Example 2) There is no effective algorithm that determines 
whether a given graph has an independent subset of size k.^ 

* In a much more general approach. Wolfram Hogrebe writes in his preface to the proceedings 
volume of the XIX. Deutscher Kongrefi fur Philosophie that was held in Bonn in September 
2002 with the general topic Grenzen und Grenziiberschreitungen: 

“Die limitativen Aspekte der Condition Humaine ... [sind] nie nur Beschrankungen, son- 
dern immer auch Bedingungen der Konturfahigkeit im Ausdrucksraum.” [Hogi02] 

^ An independent subset of a graph is a set A such that for a,b € A, there is no edge between 
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The statement of (Example 2) is a universally quantified statement, 
so in order to prove it, you have to show for each effective algorithm that 
it doesn’t do the job. This requires a formalization of the notion of an 
“effective algorithm”, the development of a metatheory of algorithms. 

The usual formalization of “effective algorithm” in computer science 
is “polynomial time algorithm”.^ Using that formalization, Agrawal, Sax- 
ena, and Kayal have answered the question in (Example 1) positively in 
their already famous [AgrKaySaxcx)]: they give an 0(log^'^ n)-time al- 
gorithm for determining whether a given number is prime. Looking at 
(Example 2), the question of whether a given graph has an indepen- 
dent subset of size k is NP-complete; cf. [Pap94, Theorem 9.4]. Con- 
sequently, the validity of the statement of (Example 2) is equivalent to 
P ^ NP. In light of this, a proof of P 7 ^ NP would be another limita- 
tive theorem, locating all problems solvable in polynomial time cis and 
the NP-complete problems like TSP, SAT and the problem mentioned 
in Example 2 trans of the barrier determined by our formalization of 
“effective algorithm”."^ 

Let us move from the examples from computer science to an example 
from pure mathematics: In classical geometry, you are interested in con- 
structions with ruler and compass: what lengths can be constructed from 
a given unit length by drawing auxiliary lines with ruler and compass. 
E.g., can we give a geometric construction of \/2, of of tt? 

Again, you can prove existential statements of the form “There is a 
ruler-and-compass construction of X” by just displaying the construc- 
tion, e.g., for v^. 

On the other hand, there is no way to prove a statement of the form 
“X cannot be constructed with ruler and compass” without a means of 
proving things about arbitrary ruler-and-compass constructions, i.e., of 
talking about ruler-and-compass constructions as mathematical entities. 
In fact, you can do this, and prove non-constructibility (as is taught in al- 
gebra courses) by interpreting ruler-and-compass constructions as a cer- 
tain class of field extensions. This is (a small fragment of) what is called 
Galois theory, and it would deserve the name metageometry. 

^ Cf. the introduction of Engebretsen’s paper in this volume: “It has been widely accepted that 
a running time that can be bounded by a function that is a polynomial in the input length is a 
robust definition of ‘reasonable running time’ (Engebretsen, p. 78)”. 

Here, TSP is the Traveling Salesman Problem [Pap94, § 1.3] and SAT is the Satisfiability 
Problem [Pap94, §4.2]; cf. also Section 6. 
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For classical construction problems like squaring the eirele, doubling 
the eube (the Delie problem), triangulation of arbitrary angles, and eon- 
struetion methods for the regular p-gon, humankind had been seeking un- 
sueeessfully for positive answers {i.e., proofs of the existential formula) 
for eenturies. Mathematieians embedded ruler- and-eompass eonstruetion 
into the rieher theory of field extensions by assoeiating to a given real r a 
field extension and proving that r being ruler-and-eompass eonstruetible 
is equivalent to a property of the assoeiated field extension.^ 

Thus, the universally quantified negative forms of the elassieal prob- 
lems mentioned above are transformed into simple statements about alge- 
braie objeets: the eube eannot be doubled by ruler and eompass because 

has degree 3 over Q; the angle of 60° eannot be triangulated, sinee the 
polynomial — |X — | is the minimal polynomial of cos(20°) over Q; 
the regular 7-gon ean not be eonstrueted sinee the 7th eyelotomie poly- 
nomial has degree 6 over Q and is the minimal polynomial of a 7th root 
of unity. 

In the eases diseussed, the limitative results relativize: similar to the 
relativization of Gddel’s ineompleteness theorem for Peano Arithmetie 
PA, we get notions of relative eomputability and relative ruler-and-eom- 
pass eonstruetibility. These give rise to problems that are not even eom- 
putable if we have the halting problem as an oraele, or numbers that are 
not ruler-and-eompass eonstruetible even if the ruler has ^ marked on 
it. As soon as a limitative theorem has told us that there are things on 
the other side of the barrier, we are not eontent anymore with the simple 
diehotomy of cis and trans. We want to eompare those that are beyond 
the barrier aeeording to their eomplexity. This leads to questions of the 
following type: 



(Example 3) Is it harder to determine whether a II 2 expression is 
satisfiable than to determine whether a quantifier free expression 
is satisfiable? 

The first problem is known as Schonfinkel-Bernays SAT, the latter 
is SAT itself. Both are NP-hard by Cook’s Theorem [Pap94, Theorem 



^ C/, e.g., [Lan93, Chapter VI], 
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8.2]. Are they equally hard, or is the first one harder than the second 
one?^ 

In order to talk about questions like this, we need a relation “is harder 
than” or “is at least as hard as” and a corresponding complexity hierar- 
chy. In this paper, we shall restrict our attention to a special class of 
complexity hierarchies, viz. those induced by reduction functions. This 
choice is motivated by the fact that the hierarchies investigated in com- 
puter science are of this type, and some of the most famous hierarchies 
in mathematical logic {e.g., the Wadge hierarchy, one-reducibility and 
many-one-reducibility) are as well. We shall introduce a notion of com- 
plexity hierarchy in an abstract way in Section 2 and then specialize in 
the sections to follow. 



Before starting with our abstract account, we should add a disclaimer: 
this paper is not a survey of notions of complexity and definitely not a 
history of complexity. Its purpose is to describe a general feature of sev- 
eral complexity notions that occur in mathematics and computer science 
and use that to tie together the papers in this volume. We shall give some 
pointers to the literature for the interested reader but there is no intention 
to be comprehensive. 

2 Abstract notions 

Let K be some basic set of objects. This can be the set of graph structures 
on a given set, of natural numbers, of real numbers etc. Suppose that we 
fixed some set F of functions from K to K that is closed under com- 
positions (i.e., if / and g are from F, then f o g is from F as well) and 
contains the identity function. We will call these functions reductions. 
Depending on the context, we can choose different sets F. 

We can now use F to define a partial preorder (i.e., a reflexive and 
transitive relation) on the power set of K, p{K): 

A<fB:^ 3feF{A = f-^[B]). 

The notion of a reduction and the derived partial preorder occur in 
many areas of mathematics and computer science: the “efficient reduc- 

® Schonfinkel-Bernays SAT is NEXP-complete [Pap94, Theorem 20.3] and SAT is in NP. 

Hence, the answer to the question in (Example 3) is ‘Yes’. 
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tions” of theoretical computer science^, the continuous and Borel reduc- 
tions of descriptive set theory*, the computable reductions from recur- 
sion theory^, and also the Rudin-Keisler ordering of the theory of ultra- 
filters. In this paper, we will only talk about comparing sets of objects 
of the same type. If you want to compare problems from different areas, 
the corresponding asymmetric version of the reductions mentioned here 
are the Galois- Tukey reductions or Galois correspondences. 

Given our reducibility relation <p defined from F, we can now de- 
fine the corresponding equivalence relation 

A =p B : < > A B A A B, 

and then look at the set of equivalence classes Cp := p{K)/=p. The 
reducibility relation <p transfers directly to a relation <i? on the equiv- 
alence classes where it is a partial order. We denote the equivalence 
class of a set A by [^]=p. The elements of can now be called the 
F-complexity classes or F-degrees. 

In the following, we shall give examples of K and F and the derived 
notions of complexity classes. 

3 Sets of Reals 

In descriptive set theory, the set of objects K is the set of real numbers 
M. This set is naturally endowed with the canonical topology. 

Traditionally, the topological space M is endowed with several com- 
plexity hierarchies of sets of real numbers that -at least prima facie- are 
not derived from complexity functions: the Borel hierarchy and the pro- 
jective hierarchy. 

For the Borel hierarchy, we set to be the set of all open sets. For 
any ordinal a, we set 11° to be the collection of complements of sets in 
S° (so n? is the set of closed sets) and S° is the collection of all sets that 
are countable unions IJieN where each Ai is in some 11° . for a* < a. 

’’ Cf. [Pap94, Definition 8.1] and Section 6. 

* Cf. Sections 3 and 4. 

^ Cf. Section 5. 

Note that the Rudin-Keisler ordering is of slightly different type because ultrafilters are sets 
of sets of numbers: for ultrafilters U and V on N, we say that U is Rudin-Keisler reducible 
to V {U <RK V) if there is a function / : N ^ N such that U := {f~^[X] X G V}. 




COMPLEXITY HIERARCHIES DERIVED FROM REDUCTION FUNCTIONS 



7 



The index a indicates how often you have to iterate the operations 
“complementation” and “countable union” to get a set which is in . 
Thus, in a preconceived notion of complexity, we could call 

CBorei(^) := minjo ; A G U 11° } 

a complexity measure for Borel sets and can derive a complexity relation 

^ ^Borel • '' '' CBorel(^) ^ CBorel(-S) 



from it. 

Similarly, we can look at the projective hierarchy (defined on finite 
Cartesian products of M). We let Sj be the set of Borel sets, and for each 
n, we let 11;'^ be the collection of complements of sets in Then 
is defined to be the set of projections of sets in 11,'^, where A C is a 
projection of i? C if 

X & A 3y{{y,x) G B). 

Again, if we set cproj(A) to be the least n such that A G U n,'^, 
we can call the following relation a complexity relation: 

A ^Proj B . \ L Cproj(A) ^ Cproj(5). 

These two hierarchies are proper hierarchies, i.e., for a < loi and 
1 < n < LO, the following inclusions are all proper:^' 



> II A 



1 

n+1' 



By the technique used to prove this chain of strict inclusions, it is con- 
nected to the limitative theorems of logic: As in the limitative theorems, 
the non-equality of the complexity classes (often called a hierarchy the- 
orem) is proved using the method of diagonalization via universal sets. 

Even more connections to logic emerge: if you look at the well- 
known Levy hierarchy of formulas where you count alternations of 
quantifiers^^, then sets definable with a real number parameter and first- 
order quantifiers over the standard model of (second-order) arithmetic 



“ Here, A° — n and A), — Si n HE 
A formula with n alternations of quantifiers starting with 3 is called a formula, a formula 
with n alternations starting with V is called H„+i. The Levy hierarchy was introduced in 
[Lev65]. 
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with a (n„) formula are exactly the (11° ) sets, and those definable 
with a real number parameter and both first- and second-order quantifiers 
over the standard model of (second-order) arithmetic with a (II„) for- 
mula are exactly the (H^) sets. 

Thus, Borel and projective complexity fit well into the concept of 
formula (Levy) complexity. 

These complexity relations are related to reduction functions in the 
following way: Let Fw be the set of continuous functions from M to M. 
Then the relation <p^ defined on p(M) is called Wadge reducibility 
<w- If you take a set A G S°\II°, then (using Wadge’s Lemma and 
Borel Determinacy) 



Similarly, (under additional assumptions) we get a description of the pro- 
jective classes as initial segments of the Wadge hierarchy: <w is a refine- 
ment of both < Borel and <Proj- 

The Wadge hierarchy is one of the most fundamental hierarchies in 
the foundations of mathematics. Under the assumption of determinacy 
of games, the Wadge hierarchy is an almost linear and well-founded 
backbone of the class of sets of real numbers. Since the theory of real 
numbers and sets of reals is intricately connected to questions about the 
axiomatic framework of mathematics as a whole, the Wadge hierarchy 
serves as a stratification of an important part of the logical strength of 
set-theoretic axiom systems for mathematics. 

For a basic introduction into the theory of Wadge degrees, we refer 
the reader to Van Wesep’s basic paper [Van78]. 

4 Equivalence Classes 

Closely connected to the Wadge hierarchy is the area called Descriptive 
Set Theory of Borel Equivalence Relations. Riccardo Camerlo’s paper 
“Classification Problems in Algebra and Topology” in this volume gives 
an overview of this area of research. 

Take two equivalence relations E and F on the set of real numbers M. 
As relations, they are subsets of M x M. A Borel function / : M — M gives 

By results of Wadge, Martin, Monk, Steel and van Wesep. 
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rise to a Borel function / :MxM— i^MxMby f{x, y) := (/(t), f{y)). 
Look at the class -FBorei of functions like this and call <FBorei re- 
ducibility <B- Note that this means that E <b F if and only if there is a 
Borel function / : M — M such that 

xEy ^ f{x)Ff{y). 

This hierarchy (as opposed to the Borel fragment of the Wadge hier- 
archy) is not at all linear: Camerlo mentions the Adams-Kechris theorem 
on p. 74 of his paper in this volume; you can embed the Borel sets (par- 
tially ordered by inclusion) into this hierarchy. 

Equivalence relations on the real numbers (or on other Polish spaces) 
play a role in classification projects. We will give an example how com- 
plexity enters the discussion here (the example is from Simon Thomas’ 
survey article [ThooOl]): 

In 1937, Reinhold Baer had given a (simple) complete invariant for 
additive subgroups of Q, thus classifying these groups up to isomor- 
phism.^"^ Kurosh and Mal’cev gave complete invariants for additive sub- 
groups of Q”, but “the associated equivalence relation is so complicated 
that the problem of deciding whether two [invariants are the same] ... 
is as difficult as that of deciding whether the corresponding groups are 
isomorphic. It is natural to ask whether the classification problem for 
[additive subgroups of Q*^] ... is genuinely more difficult when n > 2. 
[Thoo01,p. 330]”. 

This problem has been solved by Hjorth [Hjo99] by showing that the 
isomorphism equivalence relation for subgroups of has strictly higher 
complexity than the isomorphism equivalence relation for subgroups of 
Q and in fact, that it is strictly more complicated than Eq, the first non- 
smooth Borel equivalence relation (cf. Camerlo’s Theorem 2.2).^^ 

This complexity result can be seen as a metatheoretical explanation 
for the fact that Kurosh and Mal’cev couldn’t come up with better invari- 
ants for these classes of groups. 

Subgroups of Q can be seen as subsets of N via a bijection between Q and N. Via characteristic 
functions, subsets of N can be interpreted as infinite 0-1-sequences, and those can be seen as 
a real number {e.g., via the binary expansion). Thus, the isomorphism relation on subgroups 
of Q can be investigated as an “equivalence relation on the real numbers”. 

Hjorth comments on the connection between this theorem and certain pretheoretical notions 
of classifiability in his [HjoOO, p. 55, fn. 2]. 
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5 Recursion Theory 

From sets of reals, we move to sets of integers now. In Reeursion Theory, 
K is the set of natural numbers N. We shall look at two different kinds 
of reduction functions: the set Ft of total recursive functions, and the set 
Ft of total injective recursive functions. 

Each of these sets of reductions gives rise to recursion-theoretic re- 
ducibility relations: is known as many-one reducibility <m, and 

<Ft is known as 1-reducibility <i.^^ 

One of the mentioned three classical limitative theorems stems from 
Recursion Theory: Let us assume that we’re looking at computer pro- 
grams in binary code. We can ask whether a program, given its own code 
as an input, will eventually halt or run into an infinite loop. Call the for- 
mer programs halting and the latter looping. Now, is there a program 
that, given any binary code b, can determine whether the program with 
code b is halting or looping? Diagonalization easily shows that there can’t 
be such a program. The set 

H := {6 ; the machine with code b is halting} 

is called Turing’s halting problem: in terms of complexity the diago- 
nalization argument shows that H A for any computable set A (and 
so also H A). 

Hence the derived sets of degrees (Cf*, <m), and {Cpt, <i) are non- 
trivial, and, in fact, as the degrees of Borel equivalence relations, they 
are very far from being linear orders. A huge part of the literature on 
recursion-theoretic degrees investigates the structure of the complexity 
hierarchies of recursion theory. Even more prominently than <i and <m 
features Turing reducibility <t defined by the notion of computation 
in an oracle. The literature on complexity hierarchies in recursion theory 
also includes Sacks’ seminal [SacVl] in which he introduced Sacks forc- 
ing. The properties of Sacks forcing used in recursion theory are similar 
to the minimality property proved in Lemma 2.8 of the survey paper on 
set- theoretic properties of Sacks forcing by Geschke and Quickert in this 
volume. 

Of course, the above definitions are not restricted to the classical no- 
tion of computation. When you change the model of computation and 
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replace “recursive”/“computable” in the above definitions by some pred- 
icate connected to a generalized model of computation, you get new com- 
plexity hierarchies with a lot of structure to investigate. One instance of 
this is the notion of the Infinite Time Turing Machine introduced by 
Hamkins and Kidder. These machines have the same architecture as nor- 
mal Turing machines but can go on through the ordinals in their compu- 
tations. Being computable by an Infinite Time Turing Machine is a much 
more liberal property of functions, so the hierarchies we get are much 
coarser. 

In this volume, there are two papers dealing with Infinite Time Tur- 
ing Machines: a gentle introduction by Joel Hamkins entitled “Supertask 
Computation” and a paper by Philip Welch that discusses supertasks on 
sets of reals, thus connecting Infinite Time recursion theory to descriptive 
set theory (cf. Section 3). 

Having seen the rich structure theory of recursion theoretic hierar- 
chies for the classical notion of computability, one can ask with Hamkins 
(Question 2 of his contribution to this volume): 

What is the structure of infinite time Turing degrees? To what 
extent do its properties mirror or differ from the classical struc- 
ture?^^ 



6 Complexity Theory in Computer Science 

Computable functions, the reductions used in recursion theory, are not 
good enough in theoretical computer science: a computable function in 
general has no bound on computing time or storage space that is used 
while it’s being computed. The fine hierarchies of computer science that 
want to distinguish between problems that are solvable in a realistic time- 
frame and those that require too much time cannot be content with reduc- 
tions of that sort. 

The class of reductions in theoretical computer science is the class 
of functions computable with logarithmic space usage (logspace reduc- 
tions). I.e., there is an algorithm that computes f(x) from x and satisfies 

In his [Welo99], Philip Welch discusses an aspect of Hamkins’ Question 2: minimality in 
the Infinite Time Turing degrees. Again, this is connected to minimality of Sacks reals as 
discussed in the survey of Geschke and Quickert, and Welch uses a variant of Sacks’ argument 
from [Sac71]. 
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the following condition: if the input has n bits, the program will never 
use more than c ■ log(n) bits of storage space during the computation 
(where c is a constant).^* 

If Fi is the class of logspace reductions, then <f<.) is the usual 
world of complexity classes: first of all the deterministic ones, the class 
of polynomial time decidable sets P, those decidable with polynomial 
scratch space usage PSPACE, those decidable in exponential time 
EXP, and also their nondeterministic counterparts NP, coNP, 
NPSPACE, NEXP, and others. The usual inclusion diagram of com- 
plexity classes (cf. [Pap94, §§ 7.2 & 7.3]) can be verified easily: 

P c NP n coNP c PSPACE c EXP C NEXP. 

While there are some non-equality results, one puzzling and titil- 
lating feature of the complexity classes of computer science is that we 
don’t know for sure that all of the inclusions are proper. In particular, we 
are lacking the limitative theorem P ^ NP. For the reader unfamiliar 
with structural complexity theory. Rod Downey’s “Invitation to structural 
complexity” [Dow92] will be a nice way to approach the subject. 

As in recursion theory, also in complexity theory, a new model of 
computation gives rise to new classes of reduction functions, and thus, 
a fortiori, to different hierarchies.^^ Examples are the Blum-Shub-Smale 
theory of computation with real numbers^° or quantum computability as 
discussed in Ambainis’ survey in this volume. 

7 Other Complexity Hierarchies 

We abstractly discussed hierarchies derived in a particular way from re- 
duction functions. Of course, there are more complexity hierarchies in 
logic than that. 

Fundamental are the hierarchies of proof theory that are related to re- 
duction functions in a less direct way: Fix a language C and a collection 

** Cf. [Pap94, Chapter 8]. 

Interesting to note is the connection to Infinite Time Turing Machines: Schindler opened the 
discussion on analogues of the P = NP question for Infinite Time Turing Machines in his 
[Sch2Cx3], Cf. also [HamWeloOS] and [DeoHamSch2Cx3]. 

Cf. [Blu+98]; in Eootnote 4 of his article in this volume, Hamkins mentions that the Blum- 
Shub-Smale theory was the key motivation for the development of Infinite Time Turing Ma- 
chines. 
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$ of /^-sentences. From now on, we mean by an £-theory a primitively 
reeursive set of £-sentenee (understood as an axiom system of the the- 
ory). Let Proof ( pq, vi^V 2 ) be the predieate “pq is the Godel number of a 
proof of the sentenee with the Godel number v\ in the theory with the 
index ^ 2 ”- If S and T are ^-theories, we eall a funetion / a <h-reduction 
of S to T, if it is primitively reeursive and 

PRA h e $ Vn ( Proof (n, ^ S^) Proof (/(n), ) . 

We write S <$ T if such a <h-reduction exists.^^ 

As a famous special case, you can look at = {_L}. Then the ex- 
istence of a -reduction says that every proof of an inconsistency in S 
can be effectively transformed into a proof of an inconsistency in T. In 
other words, if T is consistent, then so is S. This gives rise to the consis- 
tency strength hierarchy <cons-^^ Godel’s second incompleteness the- 
orem states that PA <cons PA -f Cons(PA), so the consistency strength 
hierarchy is nontrivial. Extending Peano arithmetic to second-order arith- 
metic or even set theory gives a multitude of interesting new systems that 
form an increasing hierarchy in the ordering <cons- It is known [Rat99, 
Proposition 2.18] that the consistency strength hierarchy in general is 
not linearly ordered. Yet, surprisingly, for a large class of axiom systems 
(and, importantly, all of the axiom systems considered to be natural are 
among them) the relation has been empirically established to be a lin- 
ear order, actually a well-ordering.^^ As such, it has served as the mea- 
suring rod of logical strength in foundations of mathematics for many 
years and is subtly connected to some of the mentioned other complexity 
hierarchies as the recursion theoretic hierarchies and the Wadge hierar- 
chy. Some subareas of logic have specialized on certain fragments of the 
consistency strength hierarchy, among them are systems of second order 
arithmetic investigated by reverse mathematics, these and other systems 
of proof theory by proof theoretic ordinal analysis, and systems of higher 
set theory by large cardinal theory 

Here, PRA is primitive recursive arithmetic, a weak subsystem of second-order arithmetic; 
cf [Sim299]. 

Cf. [Rat99, §§2.5 & 2.7], 

Cf. [Steo82a] for a mathematical discussion of the connections between the theory of inner 
models of set theory and this remarkable fact. 

^ Cf. [Sim299], [Poh96], [Kah02], and [Kano 94]. For an example of a strength analysis by large 
cardinals, see Schindler’s paper in this volume where he computes the consistency strength 




14 



BENEDIKT LOWE 



The relations between the different aspects and contexts of complex- 
ity are large in number, and a short introductory article to this volume 
can’t list them all. The fact that we can only provide a very passing glance 
at the fascinating subject of complexity stresses the importance, the live- 
liness and vigour of the topic of the conference FotFS III. The slightly 
different but deeply related notions of complexity in the several areas 
discussed in this volume are entrenched within the research communities 
of mathematical logic and computer science and will serve as a bridge 
between the two subject areas for years to come. 



of the theory BPFA+“every projective set of reals is Lebesgue measurable” in terms of large 
cardinals. 
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Abstract. Many quantum algorithms can be analyzed in a query (oracle) model 
where input is given by a black box that answers queries and the complexity of 
the algorithm is measured by the number of queries to the black box that it uses. 
We survey the main results and techniques in this model, with an emphasis on 
lower bound methods. 



1 Introduction 

The modem version of the Church-Turing thesis asserts that any reason- 
able model of computation can be efficiently (with at most a polynomial 
slowdown) simulated by a probabilistic Turing machine. Quantum com- 
puting challenges this thesis from an unexpected direction. 

The Turing machine model captures our intuition about how a gen- 
eral computation might proceed. Thus, either the modem Church-Turing 
thesis is correct or there is something missing in our intuition. The miss- 
ing part is that our intuition is based on classical mechanics. Everything 
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that we encounter in our everyday life and all of today’s computers func- 
tion according to the principles of classical mechanics. Because of that, 
when we define an abstract model of computation like Turing machines, 
we are likely to incorporate some principles of classical mechanics into 
this model in an implicit form. 

By using axioms of quantum mechanics instead, one can define quan- 
tum Turing machines and other models of quantum computation. There 
is evidence that those models are more powerful than their conventional 
(classical) counterparts. Simon {cf. [Simo97]) showed an oracle problem 
which can be solved by a quantum algorithm in a polynomial time but 
requires exponential time classically. Shor {cf. [Shoi97]) gave a polyno- 
mial-time algorithm for factoring integers and discrete logarithms, prob- 
lems that are conjectured to be very hard classically. Since public -key 
cryptosystems like RSA are based on the hardness of those problems, 
building a quantum computer would undercut the foundations of today’s 
cryptography. 

Another important quantum algorithm is due to Grover {cf. [Gro296]). 
It speeds up a very general class of search problems quadratically. While 
this speedup is just polynomial, Grover’s algorithm applies to a large va- 
riety of problems which includes both NP-complete problems like satis- 
fiability or 3-coloring and search problems solvable in polynomial time. 

Almost all known quantum algorithms can be expressed in a query 
(oracle) model where the input is given by a black box which answers 
queries of a certain form. The algorithms that fall into this framework in- 
clude Grover’s algorithm, Simon’s algorithm, period-finding (a key sub- 
routine of Shor’s factoring/discrete log algorithm) and many others. One 
of the advantages of this model is that we can also prove lower bounds 
on quantum algorithms. 

In this paper, we describe the techniques for algorithms and lower 
bounds in the query model. We focus on Grover’s search algorithm and 
present the algorithm and three methods of proving that this algorithm is 
optimal. Then, we list results for various other problems. 

2 Definitions 

2.1 Notation 

We use © to denote XOR (exclusive OR). If x\ G {0, 1}, X 2 G {0, 1}, 
a;i©a ;2 denotes XOR of bits a;i,a; 2 . If xi G {0,1}”, 0:2 G {0, 1}”, a;i©a ;2 
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denotes bitwise XOR of n-bit strings x± and X 2 - We use + to denote the 
usual addition of integers. 

We use the O and 0 notation (cf. [CorLeiiRiv90]) standard in com- 
puter science. Let f{N) and g{N) be functions defined on positive inte- 
gers N and taking positive values. We say that / = 0{g) if there exists 
a constant c such that f{N) < cg{N). We say that / = o{g) if, for any 
c > 0, there exists Nq such that f{N) < cg{N) for all N > Nq. We say 
that / = fl{g) if there exists c > 0 and Nq such that f{N) > cg{N) for 
all N > Nq. 

2.2 Quantum computing 

We introduce the basic model of quantum computing. For more details, 
see the textbooks by Gruska (cf. [Grui99]) and Nielsen and Chuang (cf. 
[ChuNieOO]). 

Quantum states: We consider finite dimensional quantum systems. An 
n-dimensional pure^ quantum state is a vector \ip) E C" of norm 1. 
We use |0), |1), . . ., \n — 1) to denote an orthonormal basis for C”. 
Then, any state can be expressed as \ f>) = ai\i) for some gq E 

C, ai G C, . . ., a„_i G C. Since the norm of \'ip) is 1, = 1- 

We call the states |0), . . ., |n — 1) basis states. Any state of the form 
aj|i) is called a superposition of |0), . . ., |n— 1). The coefficient 
Gi is called amplitude of |i). 

A quantum system can undergo two basic operations: a unitary evo- 
lution and a measurement. 

Unitary evolution: A unitary transformation U is a linear transforma- 
tion on that preserves the £2 norm (i.e., maps vectors of unit norm 
to vectors of unit norm). 

If, before applying U, the system was in a state \ f), then the state 
after the transformation is U\f). 

Measurements: In this survey, we just use the simplest case of quan- 
tum measurement. It is the full measurement in the computational 
basis. Performing this measurement on a state \f) = ao|0) -f . . . -f 
Gn-i\n — 1) gives the outcome i with probability |ajp. The measure- 
ment changes the state of the system to |i). Notice that the measure- 
ment destroys the original state \ f) and repeating the measurement 

* There are also mixed quantum states but they are not used in this paper. For the rest of the 
paper, “quantum state” means “pure quantum state”. 
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gives the same i with probability 1 (beeause the state after the first 
measurement is |f)). 

More general elasses of measurements are general von Neumann and 
POVM (Positive Operator- Valued Measure) measurements {cf. 
[ChuNieOO]). 

2.3 Query model 

In the query model, the input x±, . . X]\f is eontained in a blaek box and 
ean be aeeessed by queries to the blaek box. In eaeh query, we give i 
to the blaek box and the blaek box outputs Xi. The goal is to solve the 
problem with the minimum number of queries. The elassieal version of 
this model is known as decision trees {cf. [BuhdWo02]). 



I i> 
lb> 



0 


1 




0 



X, 



X. 



X. 



Ii> 



lb-i-Xj> 



Fig. 1. Quantum black box. 



There are two ways to define the query box in the quantum model. 
The first is an extension of the elassieal query (cf. Figure 1). It has two 
inputs i, eonsisting of [log N~\ bits and b eonsisting of 1 bit. If the input 
to the query box is a basis state \i)\b), the output is \i)\b®Xi). If the input 
is a superposition the output is ^ aifi)\b © xf. Notiee 

that this definition applies both to the ease when the x* are binary and 
to the ease when they are /c-valued. In the /c-valued ease, we just make b 
eonsist of [log 2 k~\ bits and take b(B Xi to be bitwise XOR of b and Xi. 

In the seeond form of quantum query (whieh only applies to problems 
with {0, l}-valued Xi), the blaek box has just one input i. If the input is 
a state the output is ©(— While this form is less 

intuitive, it is very eonvenient for use in quantum algorithms, ineluding 
Grover’s seareh algorithm (cf. [Gro296]). 
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In this survey, we assume the first form as our main definition but use 
the seeond when deseribing Grover’s algorithm. This is possible to do 
because a query of the second type can be simulated by a query of the 
first type (cf. [Gro296]). Conversely, an oracle of the first type can be sim- 
ulated by a generalization of the sign oracle which receives ai^b\i)\b) 
as an input and outputs aj(— 

A quantum query algorithm with T queries is just a sequence of uni- 
tary transformations 

Ut-i -^O^Ut 

on some finite-dimensional space C^. The transformations Uq, Ui, ... ,Ut 
can be any unitary transformations that do not depend on the bits Xi, ... , 
xn inside the black box. O are query transformations that consist of ap- 
plying the black box to the first log N + 1 bits of the state. That is, we re- 
present basis states of as |i, b, z). Then, O maps \i, b, z) to |i, b®Xi, z). 
We use Ox to denote the query transformation corresponding to an input 
X = {xi, . . .,Xn). 

The computation starts with the state |0). Then, we apply Uq, Ox, • • • , 
Ox, Ut and measure the final state. The result of the computation is the 
rightmost bit of the state obtained by the measurement (or several bits if 
we are considering a problem where the answer has more than 2 values). 

The quantum algorithm computes a function f{xi, . . . ,xn) if, for 
every x = (ti, . . . , tat) for which / is defined, the probability that the 
rightmost bit of UtOxUt-i • • • OxUq\0) equals f{x \, . . . , tat) is at least 
1 — e for some fixed e < 1/2. 

The query complexity of / is the smallest number of queries used by 
a quantum algorithm that computes /. We denote it by Q{f). 

3 Grover’s search algorithm 

Consider the following problem. 

Problem 3.1. Input: x\ G {0,1}, X 2 G {0,1}, ..., x^ G {0,1} such 
that exactly one is 1. Output: the i such that Xi = 1. 

Classically, one needs ^1{N) queries to solve this problem and there 
is no better algorithm than just querying the locations one by one until 
we find Xi = 1. Surprisingly, there is a better algorithm in the quantum 



case. 
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Theorem 3.2 (Grover). There is a quantum algorithm that solves Prob- 
lem 3.1 with 0{\/N) queries. [Gro296] 

This is aehieved by the following algorithm {cf. [Gro296]). 

1. Apply a unitary transformation U mapping |0) to N)- 

2. Repeat for \^^/N~\ times: 

(a) Apply the query transformation O whieh maps to 

(b) Apply the following “diffusion operator” D 

(D\l) = -^\l) + ^\2) + ... + j^\N) 

I D|2) = ||l)-^|2) + ... + ^|iV) 

[d\N) = ^\1) + ^\2) + . . . - ^\N) 

3. Measure the state, output the result of the measurement. 

We note that Grover’s algorithm is effieient not just in the num- 
ber of queries but also in the running time. The reason for that is that 
the diffusion operator D can be implemented in O(logA^) time steps 
(cf. [Gro296]). Therefore, the whole algorithm ean be implemented in 

0(^VN log N) time. 




Fig. 2. Amplitudes before the first query 



Fig. 3. Amplitudes after the first query 
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Fig. 4. Amplitudes after D 



We now informally explain why this algorithm works. Our presen- 
tation follows the “inversion against average” of Grover {cf. [Gro297]). 
A different visualization of Grover’s algorithm (by two refleetions of a 
veetor) is deseribed in {cf. [Aha99]). 

To understand the algorithm, plot the amplitudes of |1), . . ., |A^) at 
eaeh step. After the first step, the state is K) the amplitudes 

are (Figure 2). After the first query, the amplitude of |i) with Xi = 1 
beeomes (Figure 3). Then, the diffusion operator D is applied. 

Let 1'^) = be the state before D. Then, the state after D is 

\f') = where a' = We ean rewrite this 

as a' = -at + and a • + a* = Y.J=i Let A = 

be the average of amplitudes a*. Then, we have a- + a* = 2A and, if a* = 
A + A, then a[ = A — A. Thus, the effeet of the “diffusion transform” 
is that every amplitude a* is replaeed by its refleetion against the average 
of all tti. 

In partieular, after the first query, the amplitude of |i) with = 1 is 
— ^ and all the other amplitudes are The average is 
whieh is almost Therefore, after applying D, the amplitude of |i) 
with Xi = I beeomes almost and the amplitudes of all other basis 
states I j) slightly less than (cf. Figure 4). 
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The next query makes the amplitude of |i) with x* = 1 approximately 
— The average of all amplitudes is slightly less than and reflect- 
ing against it make the amplitude of |i) with Xi = 1 almost 

Thus, each step increases the amplitude of |i) with Xj = 1 by 
0(1 /\/N) and decreases the other amplitudes. A precise calculation (cf. 
[Gro296], [Gro297]) shows that, after steps, the amplitude of |i) 
with Xj = 1 is 1 — o(l). Therefore, the measurement gives the correct 
answer with probability 1 — o(l). 

Boyer et al. in [Boy-i-98] have extended Grover’s algorithm to the 
case when there can be more than one i with a;* = 1. The simplest case 
is if the number of x* = 1 is known. If there are k such values, we can 
run the same algorithm with [ | N/k~\ iterations instead of [ | ^/N ] . An 
analysis similar to one above shows that this gives a random i such that 
Xi = 1 with high probability. 

A more difficult case is if k is not known in advance. The problem 
is that, after reaching the maximum, the amplitudes of |i) with Xi = 1 
start to decrease. Therefore, if we do too many iterations, we might not 
get the right answer. This problem can be handled in two ways. The 
first is running the algorithm above several times with a different num- 
ber of steps (cf. [Boy-i-98]). The second is invoking a different algorithm 
called “quantum counting” (cf. [BraiH0yTap98]) to estimate the number 
oi Xi = 1 and then choose the number of steps for the search algorithm 
based on that. Either of those approaches gives us solution to: 

Problem 3.3. Input: x\ G {0, 1}, X 2 G {0, 1}, . . xn G {0, 1}. Out- 
put: i with Xi = 1, if there is one, “none” if x* = 0 for all i. 

Theorem 3.4 (Boyer). There is an algorithm that solves the Problem 3.3 
with 0(y/N) queries. [Boy-i-98] 

Many problems can be solved by reductions to Problems 3.1 and 
3.3. For example, consider the satisfiability which is the canonical NP- 
complete problem. We have a Boolean formula F(ai, . . . , an) and we 
have to find whether there exists a satisfying assignment (ai G {0, 1}, 
■ ■ an G {0, 1} for which F(ai, . . . , a„) = 1). We can reduce the sat- 
isfiability to Problem 3.3 by setting N = 2'^ and defining xi, . . . ,a; 7 v 
to be F(ai, . . . , an) for N = 2^ possible assignments oi G {0, 1}, . . ., 
an G {0, 1}. 
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This means that we construct an algorithm that takes Oi, . . a„ and 
checks if F{ai , . . . , a„) = 1. Then, if we replace the black box in the 
Grover’s algorithm by this algorithm, we get an algorithm that finds a 
satisfying assignement in the time O(v^) times time needed to check 
one assignement. 

A similar reduction applies to any other problem that can be solved 
by checking all possibilities in some search space. 

4 Lower bounds for search 

A very natural question is: can we do even better? Is there a more com- 
plicated algorithm that solves Problem 3.1 with even less queries? If this 
was the case, we would get better quantum algorithms for all the prob- 
lems that reduce to it. In particular, an 0{\og^ N) query algorithm for 
Problem 3.1 would result in a polynomial- time quantum algorithm for 
satisfiability. 

The next theorem shows that the speed-up achieved by Grover’s al- 
gorithm is maximal. 

Theorem 4.1 (Bennett). Any quantum algorithm for Problem 3.3 uses 
queries. [Ben-i-97] 

There are several proofs for this theorem using different techniques. 

4.1 Classical adversary 

Theorem 4.1 was first shown by Bennett et al. (cf. [Ben-i-97]) using a 
classical adversary method (also called “hybrid argument”). In this proof, 
we assume that we are given a quantum algorithm A that uses o{\/N) 
queries. We then produce an input Xi, . . ., on which A outputs an 
incorrect answer. 

To do that, we first analyze the algorithm on the input x± = X 2 = 
. . . = xn = 0 on which it must output “none”. We find Xi such thaf 
the total probability of querying Xi at all steps is small. Then, we show 
that if we change Xi to 1 the final state of the algorithm does not change 
much. This implies that the algorithm either outputs the wrong answer 
on the original input x± = X 2 = ■ ■ ■ = xn = 0 or on the modified input 

xi = . . . = Xi-i =0,Xi = 1, Xi+i = . . . = Xn = 0. 
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We now formalize this argument. Let |^j) be the state before the i\h 
query (with the previous queries answered aeeording to Xi = ... = 
xat = 0). We deeompose this state as 

N 

l^i) 

with \j) being the index that is queried and \^jJij) being the rest of the 
state. For eaeh j, we eonsider the sum Si = \aij \ . This sum measures 
how mueh the algorithm queries the item Xj. Then, we have the following 
two lemmas. 

Lemma 4.2 (Bennett). Consider the algorithm A on inputs x± = . . . = 
xat = 0 and xi = . . . = Xi_i = 0, Xj = 1, Xj+i = . . . = xat = 0. The 
probability distributions on possible answers on those inputs differ by at 
most 8Si. [Ben+97] 

Therefore, if the algorithm outputs “none” on the input Xi = ... = 
X]\f = 0 with probability at least 1 — e, it also outputs “none” on xi = 
. . . = Xi-i = 0, Xi = 1, Xi+i = . . . = xn = 0 with probability at least 
1 - e - 8Si. 

Lemma 4.3 (Bennett). Let T be the number of queries. There exists 
i G {1, . . . , iV} with Si < [Ben+97] 

Therefore, if the number of queries is less than c\/N for a small eon- 
stant c, the probability of the algorithm A outputting “none” on x\ = 
. . . = Xi-i = 0, Xj = 1, Xj+i = . . . = Xjv = 0 is elose to 1 — £ for some 
i. This means that A is ineorreet on this input. 

4.2 Polynomials method 

The seeond approaeh {cf. [Beao+01]) uses a eompletely different idea. 
We deseribe the probability of the algorithm outputting a given answer by 
a low-degree polynomial p. If the algorithm eomputes / with probability 
1 — e, the polynomial approximates / within £. Then, the lower bound 
on the number of queries follows from a lower bound on the degree of 
the polynomial approximating /. The key result is 
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Lemma 4.4. Let |^) be the state of a quantum algorithm after k queries. 
Then, |i/>) = Y.i • • • , XN)\i), with ai{xi, being polyno- 

mials of degree at most k. [Beao-tOl], [For2Rogi99] 

At the end of the algorithm, the probability of the measurement giv- 
ing i is the square of the amplitude of |i). Since the amplitude is a poly- 
nomial of degree at most k, its square is a polynomial of degree at most 
2k. 

Consider the version of Problem 3.3 where the algorithm is only re- 
quired to find if there is = 1 or not. Let p{x \, . . . , tat) be the proba- 
bility of the algorithm outputting the answer “there exists = 1”. Then, 
by the argument above, p is a polynomial in . . . , tat of degree at most 
2T. If the algorithm is correct, this polynomial must satisfy the following 
properties: 

1. 0 < p(0, . . . ,0) < e; 

2. If Tj = 1 for some i, then 1 — e < p{xi , . . . , tat) < 1. 

The authors of [NisSze92], [Pati92] show that any polynomial satisfy- 
ing these constraints has degree 12 (V^). The first step of the proof is 
to use the symmetry to transform the polynomial in N variables into a 
polynomial in 1 variable (xi + X 2 + ■ ■ ■ + xn) of the same degree. This 
gives a polynomial p{x) such that 0 < p(0) < e but 1 — e < p{x) < 1 
for all X E Then, one can apply results from approximation 

theory (cf. [Che66]) to show that any such polynomial must have degree 

n{VN). 

4.3 Quantum adversaries 

The third approach is the “quantum adversary” method by the present 
author (cf. [AmboOO]). The main idea is to run the algorithm on an input 
that is a quantum state. That is, we consider a quantum system consist- 
ing of two registers Ha and Hj, with one register (Ha) containing the 
algorithm’s state and the other register (Hi) containing the input. At the 
beginning Ha contains the algorithm’s starting state (|0)) and Hi con- 
tains a state of the form ax\x) where x = (xi, T 2 , . . . , xn) ranges 
over all possible inputs. If the algorithm performs a unitary transforma- 
tion Ui, we just perform Ui on Ha- Queries O are replaced by unitary 
transformations O' that are equal to Ox on the subspace consisting of 
states \ f)\x), \ f) E Ha- 
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If the input register Ti/ contains a basis state |a;), then the state in 1~La 
at any moment is just the state of the algorithm on the input x. 

For example, consider the search (Problem 3.1). First, we look at the 
case when Hi contains the basis state |a;) with Xi = 1 and Xj = 0 for 
j 7 ^ i. The correct answer on this input is i. Therefore, the end state 
in Ha must be |i). This means that the algorithm transforms the starting 
state containing |0) in Ha register into the end state containing |i) in Ha- 

|0)|0_^10_^) ^ |i)|0_^10_^). (1) 

i—l N—i i—1 N—i 



We can also put a combination of several basis states |a;) into Hj. Then, 
we are running the algorithm on an input that is a quantum state. Since 
unitary transformations are linear, the end state of the algorithm on the 
starting state |0) oix\x) is just a linear combination of the end states 
for the starting states |0)|a;). For the search problem, we consider the 
starting state that is Yl,x ^1^)’ summation over all x that have ex- 
actly one Xi = 1. Then, by summing up (1) for i = 1, . . . , iV, we get 



E 



1 

Vn 



i-l N-i 



N 

E 






( 2 ) 

i-l N-i 



Notice an important difference between the starting state and the end 
state in (2). In the starting state, the Ha and Hi registers are uncorre- 
lated. For any state in Hi, Ha register contains the same state |0). In the 
end state, the two registers are perfectly correlated. For every of N basis 
states |0 . . . 010 ... 0) in Hi, Ha contains a different state. 

This is also true for other problems. In general, if the algorithm com- 
putes a function /, the parts of state in Hi that correspond to inputs x 
with one value of f{x) become correlated with parts of Ha which corre- 
spond to the algorithm outputting this answer^. 

To bound the number of queries, we introduce a quantity measuring 
the correlations between Ha and Hi. We show bounds on this quantity 
correlations in the starting state, end state and the maximum change in 
one query. This implies a lower bound on the number of queries. For 
more details, cf. [AmboOO]. 



^ Those correlations are often called quantum entanglement because they are different from 
correlations between classical random variables, cf. [ChuNieOO]. 
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4.4 Comparison of different methods 

While all three deseribed methods give the same lower bound 

for Problem 3.3, they are of different strength for other problems. 

Classieal adversary arguments ean be usually reeast into the quantum 
adversary framework. Thus, the quantum adversary method is a general- 
ization of the elassieal adversary. 

The strengths of the quantum adversary and the polynomials method 
are ineomparable. For many problems, both of them give optimal lower 
bounds. The polynomials method usually needs the problem to have a 
eertain degree of symmetry. The reason is that it is easier to deal with 
polynomials in one variable, rather than in N variables. For seareh and 
other symmetrie funetions, one ean eonvert an TV-variable polynomial to 
a one-variable polynomial. However, if the problem lacks symmetry, it 
might be difficult. 

On the other hand, the polynomials method is better for proving lower 
bounds for quantum algorithms with zero error or very small error (cf. 
[Buh-i-99]). It is also the only method which gives a lower bound for the 
collision problem (cf. [Aar02], [Shi02]). 

5 Other problems 

A variety of other problems have been studied in the query model. We 
now list some main results and open problems. 

5.1 Period-finding 

This is a class of problems for which quantum algorithms have achieved 
the most impressive speedups. Two examples are: 

Problem 5.1. We are given a black box computing a 2-1 function 

/:{o,ir ^{o,ir 

such that there exists x G {0, 1}", T 7 ^ 0” such that f(x®y) = f(y) for 
all y. We want to find x. [Simo97] 

Classically, this problem requires querying f(x) for values of 

X. In the quantum case, 0(n) quantum queries are enough (cf. [Simo97]). 
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This was the first example of an exponential advantage for a quantum 
algorithm. It led to Shor’s factoring algorithm {cf. [Shoi97]) which is 
based on the following black-box problem. 

Problem 5.2. We are given a black box computing a /c-to-1 function 

such that f{x) = f{x + k) for all x. We want to find k. [Shoi97] 

The problem can be solved with 0(1) queries, cf. [Shoi97], (com- 
pared to f2(v^) classically, cf. [CleOO]). 

Both of those problems (and several other problems) are particular 
cases of the Abelian hidden subgroup problem (cf. [EkeMoso98]). 

5.2 Finding colli s ions 

The algorithms for period-finding rely on the algebraic structure of the 
problem. If there is no such structure, the problem becomes much harder. 

Problem 5.3. We are given f : {1, ..., N} ^ {1, ..., N} such that / is 
2-1 and want to find x, y such that f(x) = f(y). 

Notice that Problem 5.1 is a particular case of Problem 5.3 with more 
structure (f(x) = f(y) if and only if y = x + z). The absence of this 
structure changes the complexity dramatically. While Problem 5.1 can be 
solved with 0(n) = 0(\ogN) queries, the best quantum algorithm for 
Problem 5.3 uses 0(N^^^) queries (cf. [BraiH0yTap97]). The algorithm 
uses Grover’s search algorithm. 

Proving a lower bound was open for four years, until Aaronson (cf. 
[Aar02]) showed an lower bound and Shi (cf. [Shi02]) improved 

it to fl(N^^^). The proof uses the polynomials method in a clever way. A 
related problem is element distinctness: 

Problem 5.4. We are given / : (1, . . . , A^} — {1, . . . , A^} such that 
there is a unique x, y such that x y and f(x) = f(y). We want to find 

x,y. 

There is a quantum algorithm that solves element distinctness with 
0(N^/^) queries (cf. [Buh-i-01]). It is based on a recursive use of Grover’s 
algorithm in a clever way. An Vt(N‘^/‘^) lower bound has been shown by 
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Shi (cf. [Shi02]). Closing the gap between those two results is an open 
problem. There is also a similar gap between upper and lower bounds for 
problems intermediate between 5.3 and 5.4 (functions / that have k pairs 
X, y such that f{x) = f{y)). 



5.3 Counting and approximate counting 

This is a class of problems studied by [Gro298], [BraiH0yTap98], 
[NayWu99] and others. Up to quadratic speedup can be achieved. An 
example is 

Problem 5.5. We are given xi e {0, 1}, X2 G {0, 1}, . . ., xn G {0, 1} 
and want to find k = Xi + . . . + x^. 

Classically, this requires Vt{N) queries. In the quantum case, 
0{\/{k + 1){N — k + 1)) queries are sufficient If k ~ N/2, this is the 
same as 0{N) and there is no speedup. However, if k is close to 0 or 
N, quantum algorithms achieve up to quadratic speedup. If k = 0(1), 
0{\J{k + l)(iV — /c + 1) becomes O(v^), but classical algorithms still 
need Vt{N) queries. 

For k ~ N/2, although quantum algorithms need the same number 
of queries as classical for exact counting, they are better for approximate 
counting. 

Problem 5.6 (e-approximate count). We are given x\ G {0, 1}, X2 G 

{0, 1}, . . ., Tat G {0, 1} and we know that either at least ^ + e fraction of 
them are 0 or at least ^ + e fraction of them are 1. We want to distinguish 
these two cases. 

This can be solved with quantum algorithms with 0{l/e) queries, 
compared with 0{l/e^) queries needed classically {cf. [Gro298]). Both 
exact and approximate counting results are tight. Matching lower bounds 
can be shown by either the polynomials method {cf. [NayWu99]) or the 
quantum adversary {cf. [AmboOO]). 

For more information, cf. [Brai+02], [Gro298], [NayWu99]. 



^ Cf. [BraiH0yTap98]. 
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5.4 Binary search 

Problem 5.7. We are given xi = . . . = Xi = 0, Xj+i = . . . = x^ = 1 
and want to find i. 

Classieally, log 2 N queries are needed. Quantum algorithms ean solve 
this problem with 0.53... log 2 iV (cf. [Fari+99]) or 0.65... log 2 iV 
(cf. [HpyNeeShiOl]) queries using two different teehniques. The best 
lower bound is 0.22... log 2 N (cf. [HpyNeeShiOl]) by a variant of “quan- 
tum adversary”. Thus, the speedup is at most a eonstant. It remains open 
to determine the exaet eonstant aehievable by quantum algorithms for 
this basie problem. 

Related problems are searehing and element distinetness, both of 
whieh have been studied in [HpyNeeShiOl]. 

5.5 General bounds 

There are elasses of problems for whieh we ean bound the maximum 
improvement over elassieal algorithms aehievable for any problem in this 
elass. The broadest sueh elass is the elass of total Boolean funetions. 

Let / : {0, 1}^ — {0, 1} be a total funetion of N variables xi G 
{0, 1}, . . xn G {0, 1}. Let Q(f) be the quantum query eomplexity 
of / and D(f) be the deterministie query eomplexity. Beals et al. in 
[Beao-i-01] showed that D(f) = 0(Q®(/)). Thus, for total funetions, 
quantum algorithms ean have at most a polynomial speedup. This eon- 
trasts with partial funetions for whieh an exponential speedup is possible. 
(An example is Problem 5.1 or Problem 5.2.) 

It is open what is the maximum possible speedup aehievable for total 
/. The best known example is the OR funetion 

xi OR X 2 OR . . . OR a; AT. 

Its deterministie query eomplexity is N but its quantum query eomplexity 
is 0(\/N) whieh is obtained by Grover’s algorithm. (Solving Problem 
3.3 is enough to eompute the OR funetion. If we find any Xi = 1, OR of 
all Xi is 1. If there is none, OR is 0.) 

An exeellent survey on this topie is [BuhdWo02]. 
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5.6 Other results 

Van Dam in [vDa98] has shown that any function of N 0,1 -valued vari- 
ables can be computed with N/2 + o{N) queries. This is optimal because 
it is also known that any quantum algorithm needs N / 2 queries to com- 
pute the parity of xi, ... (cf. [Beao-i-01], [Fari-i-98]). 

Buhrman et al. in [Buh-i-99] have studied the number of queries nee- 
ded for zero-error quantum algorithms and small-error quantum algo- 
rithms. They showed that, for several problems, the speedup achieved by 
quantum algorithms disappears in the zero-error model. Thus, requiring 
a quantum algorithm to give the correct answer with probability 1 — e 
(instead of 1) can be important for the construction of good quantum al- 
gorithms. They also studied the tradeoff between the decrease in the error 
and increase in the number of queries for the search problem. 

The present author and de Wolf in [AmbodWoOl] studied the average- 
case query complexity when we allow the algorithm to use different num- 
bers of queries on different inputs and count the expected number of 
queries, under some distribution over possible values of Xi, . . . , x^. 

6 Beyond the query model 

A related area is quantum communication complexity. In the standard 
model of communication complexity (cf. [KusNis97]), we have two par- 
ties (often called Alice and Bob) computing a function f{x, y) of two in- 
puts, with the input x known to one party (Alice) and the input y known 
to the other party (Bob). Since neither party knows both inputs, commu- 
nication between the two parties is needed to compute /. The commu- 
nication complexity of / is the minimum number of bits that need to be 
communicated between Alice and Bob to compute /. 

The speedups for problems in the query model often imply similar 
speedups for communication problems. For example, the 0(V^)-query 
algorithm for problem 3.3 can be used to construct a communication pro- 
tocol that solves the following problem (called set disjointness) by com- 
municating just 0{^/W\ogJ7) quantum bits"^ between Alice and Bob (cf. 
[BuhCleWig98]). This provides almost quadratic improvement over the 
classical case because, classically, fl{N) bit communication is needed to 
solve this problem (cf. [KalSch592], [Razi92]). 

The result has been further improved to 0(-\/N c*°® ") hits by [H0ydWoO2]. 
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Problem 6.1. Alice is given a subset A C {1, . . . , A^}, Bob is given 
i? C {1, . . . , N}. They want to find out if A and B are disjoint. 

Query algorithms imply corresponding communication algorithms 
but proving lower bounds for quantum communication complexity is 
more difficult (because communication complexity is a more general set- 
ting). Proving a good lower bound for Problem 6.1 was open for about 
five years until, very recently, Razborov in [Razi03] showed that Q(\/]V) 
bits of communication are needed. The proof uses polynomials method 
in a combination with several other ideas. 

For a survey on quantum communication complexity, cf. [dWo02] . A 
good book on classical communication complexity is [KusNis97]. 
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Abstract. Let R{A) denote the rank (also called bilinear complexity) of a finite 
dimensional associative algebra A. A fundamental lower bound for R{A) is the 
so-called Alder-Strassen bound R{A) > 2 dim A — t, where t is the number 
of maximal twosided ideals of A. The class of algebras for which the Alder- 
Strassen bound is sharp, the so-called algebras of minimal rank, has received 
a wide attention in algebraic complexity theory. A lot of effort has been spent 
on describing algebras of minimal rank in terms of their algebraic structure. We 
review the known characterisation results, study some examples of algebras of 
minimal rank, and have a quick look at some central techniques. We put special 
emphasis on recent developments. 



1 Introduction 

How many operations are necessary and sufficient to multiply two matri- 
ces? Or two triangular matrices? How about two polynomials modulo a 
third fixed one? These questions are examples of perhaps the most funda- 
mental problem in algebraic complexity theory, the complexity of multi- 
plication (in associative algebras). To be more specific, let A be a finite 
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dimensional associative /c-algebra with unity 1. (In the following, we will 
often speak of an algebra for short.) We fix a basis of A, say vi, ... ,v^, 
and define a set of bilinear forms corresponding to the multiplication in 
A as follows: If 

N 

^ for 1 < /i, z/ < iV 

K=\ 

(k) 

with structural constants E k, then these constants and the identity 

( TV \ ^ ^ 

E E = E 

fl=l ) V U=1 ) K=1 

define the desired bilinear forms bi, ... ,bN- The bilinear complexity or 
rank of these bilinear forms 6i, . . . , 6 tv is the smallest number of essential 
bilinear multiplications necessary and sufficient to compute bi, ... ^b^ 
from the indeterminates Xi, . . . , and Yi, ... , Yjq. More precisely, 
the bilinear complexity of &i, . . . , 6 at is the smallest number r of prod- 
ucts pg = Ug{X) ■ VgiY) wlth Imcar forms Ug and Vg in the X/s and 
Yj's, respectively, such that &i, . . . , are contained in the linear span of 
Pi, ... ,Pr. From this definition it is obvious that the bilinear complexity 
of 6i, . . . , does not depend on the choice of wi, . . . , uat. Thus we may 
speak about the bilinear complexity of (the multiplication in) A. For a 
modem introduction to this topic and more background, the reader is re- 
ferred to [BiirClaShoo97]. For more details on associative algebras, we 
recommend [DroKir94], 

A central lower bound for the rank of an associative algebra A is the 
so-called Alder-Strassen bound (cf. [AldStrSl]). It states that the rank of 
A is bounded from below by twice the dimension of A minus the num- 
ber of twosided ideals in A. This bound is sharp in the sense that there 
are algebras for which equality holds. Since then, a lot of effort has been 
spent on characterising these algebras in terms of their algebraic stmc- 
ture. There has been some success for certain classes of algebras, like 
division algebras, commutative algebras, etc., but up to now, no general 
result holding over all fields is known. Recently, some progress has been 
made, namely a complete characterisation of all semisimple algebras of 
minimal rank and of all algebras over algebraically closed fields has been 
provided (cf. [BlaOO]). The latter results can also be generalised to per- 
fect fields (cf. [BlaOl], [Bla02]). In contrast to prior results, these classes 
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of algebras also contain algebras such that the quotient algebra Aj rad A 
has factors that are not division algebras. Here rad A denotes the radical 
of A which is defined as the intersection of all maximal twosided ideals 
of A. 

Closely related to the bilinear complexity is the multiplicative com- 
plexity. Here we consider products p\ = U\{X, Y) ■ V\{X, Y) where u\ 
and v\ are linear forms both in the Xj’s and the Yj’s. The multiplicative 
complexity is clearly a lower bound for the bilinear complexity and it is 
easy to show that twice the multiplicative complexity is an upper bound 
for the bilinear complexity, see, e.g., [BurClaShoo97, Chapter 14]. 

1.1 Model of Computation 

Next we give a coordinate-free definition of rank. Such a definition turns 
out to be more appropriate in the following. 

For a vector space V, V* denotes the dual space of V. 

Definition 1.1. Let k be a field, U, V, and W finite dimensional vector 
spaces over k, and (p : U x V ^ W he a bilinear map. 

1. A sequence (3 = (/i, 5 ( 1 , wi, . . . , fr, gr, Wr) such that e U*, G 
V*, and Wq eW is called a bilinear computation of length r for p if 

r 

p{u, v) = '^ fg{u)gg{v)wg for all M e f/, n e V. 

e=i 

2. The length of a shortest bilinear computation for p is called the bilin- 
ear complexity or the rank of p and is denoted by R{p). 

3. If A is a finite dimensional associative /c-algebra with unity, then the 
rank of A is defined as the rank of the multiplication map of A, which 
is a bilinear map A x A ^ A. The rank of A is denoted by R( A). 

The products fp(u) ■ gg{v) stand in one-to-one correspondence with 
the products pg defined above. Definition 1.1 also explains the alternate 
name rank: One can identify Bil(f/, V; W), the set of all bilinear maps 
U xV ^ W, with U* ®V* ® W. There the triads fg®gg® Wg are rank 
one elements. Just as the rank of an ordinary matrix M is the minimal 
number of rank one matrices such that M is contained in the linear span 
of these rank one matrices, the rank of a bilinear map p is the minimal 
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number of rank one triads such that if (more precisely, the tensor of cp) 
can be written as the sum of these triads. 

If we allow that fg and are linear forms onU x V m the above 
definition, we get quadratic computations and multiplicative complexity. 

With the introduced terminology, the Alder-Strassen bound can be 
rephrased as follows: 

R{A) > 2 dim A — # of maximal twosided ideals of A (1) 
for any algebra A. 

Definition 1.2. An algebra A is called an algebra of minimal rank if 
equality holds in (1). 

One prominent example of an algebra of minimal rank is the 
algebra of 2 x 2-matrices. Strassen [Str69] showed < 7. On 

the other hand, is a simple algebra, i.e., it has only one maximal 
twosided ideal. For a long time it had been open whether is of 
minimal rank or not, see [BiirClaShoo97, Problem 17.1]. The idea was 
that if one could characterise all algebras of minimal rank in terms of 
their algebraic structure, then one simply has to verify whether has 
this structure or not. Meanwhile, we know that k^^^ is not of minimal 
rank [Bla99]. Nevertheless, the characterisation of the algebras of min- 
imal rank is an interesting and important topic in algebraic complexity 
theory on its own, see, e.g., [Str90, § 12, Problem 4] or [BiirClaShoo97, 
Problem 17.5]. 



1.2 Organisation of the Article 

The organisation of the article follows closely the structure of the in- 
vestigated algebras. We first treat division algebras, then local algebras, 
commutative algebras, and basic algebras. These results are the “old” re- 
sults in the sense that the characterisation results for these classes are 
known for more than fifteen years. After that we turn to the new results, 
starting with simple algebras. After that, we study semisimple algebras 
and finally consider arbitrary algebras over perfect fields. We conclude 
this article with some interesting open problems. 
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2 Division Algebras 

The easiest example to study in the context of bilinear complexity are di- 
vision algebras. Recall that an algebra is called a division algebra if all of 
its nonzero elements are invertible. For this class, Fidducia and Zalcstein 
in [FidZaloVV] proved one of the first lower bounds for the complexity 
of associative algebras, namely R{D) > 2dimD — 1 for any division 
algebra D. This is nothing else than the Alder-Strassen bound for divi- 
sion algebras, since division algebras have only one maximal twosided 
ideal, the zero ideal. Division algebras are also the first class of algebras 
of minimal rank for which a characterisation in terms of their structure 
was obtained (cf. [dGr83]). 

Assume that D is a division algebra of minimal rank. Furthermore, 
let j3 = (/i, gi,wi,..., fr, Qr, Wr) bc a bilinear computation for D where 
r = 2 dimD — 1. We claim that /i, . . . , /^ generate D* (as a vector 
space). If this was not the case, then there would be some nonzero x G 
ker fg. But this would mean 

r 

x = x-l = Y^ f^{x)gQ{l)wi, = 0, 

£>=1 

a contradiction. Thus without loss of generality, /i, . . . , /at generate D* 
(with N = dim D). Let xi, . . . ,xn be the dual basis of /i, . . . , /at. By a 
technique called “sandwiching” (see, e.g., [dGr86]) we may also assume 
that xn = 1. Loosely speaking, sandwiching allows us to replace the ba- 
sis xi, . . . , xat by axib, . . . ,axNb for any invertible elements a, 6 G D. 
This of course changes also the computation f3. But still (3 is a computa- 
tion for D. 

Now g^: ■ ■ ■ , 172 A - 1 generate D*, too. If this was not the case, then 
we could find some nonzero y G ker But then 

2A-1 

y = XN-y= ^ f(,{xN)gg{y)wg = 0, 

£1=1 

since x^ E b;er fi by the definition of dual basis. Once more, we 
have obtained a contradiction. Let i/at, |/i, . . . , yN-i denote the dual basis 
of 77 a, 77A-I-1, • • • , 772A-1- (The shift of the indices is introduced for con- 
venience.) Again we also can achieve that y^ = 1. This does not change 
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Xi, ... ,xn- Exploiting the definition of dual basis, we obtain 

2N-1 

Xi-1= ^ fg{xi)gi,{l)wi, = gi{l)wi for 1 < i < iV - 1, 
e=i 
2A-1 

1 ■ %■ = fs(^)9g{yj)wg = fN+j{l)wN+j for 1 < j < iV - 1. 
e=i 

The factors gi{l) and /jv+j(l) are all nonzero and thus we may assume 
that they all equal one by scaling the products appropriately. It follows 
that 



2 A -1 

£>=1 

= 9i{yj)Wi + fN+j(Xi)wN+j 

e k ■ Xi + k ■ yj for 1 < i,j < N — 1. (2) 

A pair of bases 1, xi, . . . , xn-i and 1, yi, . . . , yN-i such that (2) holds is 
called a multiplicative pair or M-pair for short. Such pairs play a crucial 
role in the characterisation of algebras of minimal rank for which the 
quotient A/ rad A consists solely of factors that are division algebras. 

In the case of a division algebra, we can proceed as follows: Assume 
that D is not simply generated. Consider D as a vector space over K : = 
k{xi). Since D is not simply generated, we have d ;= dim^i- D > l.By 
a suitable permutation, we can achieve that y±, . . . ,yd is a basis of the 
A- vector space D. Now (2) implies 



xi ■ yi, . . . ,xi ■ yd e k ■ xi + k ■ yi f k ■ yd- 



But over k, xi ■ yi,. . . ,x\ - yd and yi,. . . ,yd are linearly independent. 
Comparing dimensions we get a contradiction, as d > 1. Thus D has to 
be simply generated, that is, it is isomorphic to k[X]/ {p{X)) for some 
irreducible polynomial p. This yields the following classification result. 

Theorem 2.1 (de Groote). A division algebra D over some field k is 
of minimal rank iff D is simply generated and #/c > 2 dim A — 2. In 
particular, A is a field. [dGr83] 
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The field k has to be large enough sinee the upper bound R{D) < 

2 dim D — 1 is obtained via polynomial multiplieation. Two univariate 
polynomials ean be multiplied modulo a third one of degree n with 2n — 
1 bilinear multiplieations iff > 2n — 2, as shown by Winograd in 
[Wino79]. To avoid sueh lengthy diseussion on the size of the field, we 
assume that in the following the underlying field k is always infinite. 

3 Local Algebras 

An algebra A is ealled loeal if A/ rad A is a division algebra. One exam- 
ple of local algebras of minimal rank can be easily obtained from those 
in the previous section. Let p be an irreducible polynomial of degree d 
in the indeterminate X over k. Then k[X]/ (p^) is a simply generated lo- 
cal algebra. For i = 1, this is even a simply generated division algebra, 
hence it is of minimal rank. For arbitrary i, k[X]/{p^) is an algebra of 
minimal rank, too. We have R{k[X]/{p^)) < 2 ■ di — 1 by utilising fast 
polynomial multiplication modulo p^. One the other hand, k[X]/{p^) has 
only one maximal twosided ideal, thus R{k[X]/ (p^)) > 2 ■ df — 1 by the 
Alder- Stras sen bound (1). 

A second example are the so-called generalised null algebras. A gen- 
eralised null algebra A is generated by nilpotent elements ei , . . . , such 
that 6i ■ 6j = 0 whenever i ^ j. Generalised null algebras are local, 
their maximal ideal is (ei, . . . , 6^)- Moreover, they are of minimal rank. 
Again the upper bound is obtained via univariate polynomial multipli- 
cation. Let us study this fact with a little example. Consider the three 
dimensional algebra T = k[X, F]/(X^, XY, F^). By the Alder-Strassen 
theorem, R{T) > 5. One the other hand, given two elements 

a + bX + cY and a' + b'X + c% 

we can multiply them with five bilinear multiplications as follows. We 
first multiply a + bX with a' + b'X (modulo X^). Note that this is noth- 
ing else than univariate polynomial multiplication. The result aa' + {ba' + 
ab')X can obviously computed with three multiplications. In the same 
way, we multiply a + cY with a' + c'Y . Again this needs three multi- 
plications. But we compute the product aa' for a second time. Hence we 
can save one multiplication and get the upper bound five. The same trick 
works for arbitrary generalised null algebras. 
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De Groote and Heintz showed that there are no further commutative 
local algebras. 

Theorem 3.1 (De Groote & Heintz). A commutative local algebra is 
of minimal rank iff it is simply generated or a generalised null algebra. 
[dGrHeii83] 

Biichi and Clausen extended this result to arbitrary local algebras. It 
turns out that local algebras of minimal rank are “almost commutative”. 

Theorem 3.2 (Biichi & Clausen). A local algebra A is of minimal rank 
iff it is simply generated or there exists a commutative local subalgebra 
B C A of minimal rank such that A = La + B = Ra + B. [BiicClaSS] 

Here and denote the left and right annihilator of rad A, which 
are defined by 



La = {x e rad A | a;(radA) = {0}} and 
Ra = e rad A | (radA)a; = {0}}. 

The proofs of the above results extensively utilize the concept of re- 
pairs of bases. 

3.1 Dependence on the Characteristic 

Consider the four dimensional local algebra S = k[X, F]/(X^, F^). We 
have S = k- l + k- X + k- Y + k- XY as vector spaces. Over fields 
of characteristic distinct from two, S is generated by the two elements 
X + Y mdX-Y, because {X + Yf = -{X -Yf = 2XY. Moreover, 
{X + Y){X — Y) =0 (in S). Therefore S' is a generalised null algebra 
and we have R{S) = 7. 

On the other hand, S is not of minimal rank over fields of charac- 
teristic two. If S would be an algebra of minimal rank, then it has to 
be a generalised null algebra. Take two nilpotent elements in S, say 
aX + bY + cXY and a'X + bY' + c'XY, that annihilate each other. 
We have 

{aX + bY + cXY){aX + b'Y + c'XY) = {ab' + ba')XY 

= {ab' - ba')XY 
= 0 . 
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But that means that the determinant ab' — ba' vanishes. In partieular, any 
elements that pairwisely annihilate eaeh other eannot generate both X 
and Y , a eontradietion. 

4 Commutative Algebras 

Commutative algebras A are isomorphie to a direet produet of eommuta- 
tive loeal algebras A = A^x - ■ - x At, a result due to Artin. In the previous 
seetion, we have already eharaeterised all loeal algebras of minimal rank. 
This would already eomplete the elassifieation of eommutative algebras 
of minimal rank provided that R(A) = J2t=i -R(^t) would hold. How- 
ever it is not known whether this holds in general. In faet, this is the so- 
ealled direct sum conjecture made by Strassen in [Str73]. It is elear that 
R{A) < Y^t=i holds, sinee we ean eombine the single eomputa- 

tions for eaeh Ar. Until today, we do not know any example where this 
inequality is striet. On the other hand, Sehonhage in [Seh781] eonstrueted 
an example for whieh striet inequality holds for a related quantity, the so- 
ealled border rank. (Aetually, his example did not involve multiplieation 
in algebras but multiplieation of reetangular matriees.) 

In our ease here, we are lueky. De Groote and Heintz in [dGrHeii83] 
showed that the direet sum eonjeeture holds for eommutative algebras of 
minimal rank. They even proved the stronger result that every eompu- 
tation of A splits into t eomputations for Ai, At. Summarising the 
thoughts above, we obtain the following eharaeterisation. 

Theorem 4.1 (De Groote & Heintz). A eommutative algebra is of min- 
imal rank iff it is isomorphie to a produet of eommutative loeal algebras 
of minimal rank as deseribed in Theorem 3.1. [dGrHeii83] 

5 Basic Algebras over Algebraically Closed Fields 

An algebra A is ealled basie if A/ rad A is isomorphie to a direet produet 
of division algebras. Sinee over algebraieally elosed fields k, the only di- 
vision algebra is k itself, we have A/ rad A = kj for basie algebras over 
algebraieally elosed fields. Again utilising the eoneept of M-pair, Heintz 
and Morgenstem in [HeiiMori86] were able to prove the following re- 
sult. 
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Theorem 5.1 (Heintz & Morgenstern). A basic algebra A over an alge- 
braically closed field has minimal rank if and only if there exist 
tci, . . . , Wm e rad A with WiWj = 0 for i ^ j such that 

rad A =La + {wi) H h {wm) = Ra + (wi) H h {wm)- 

Up to La and Ryi, basic algebras of minimal rank over algebraically 
closed fields are commutative. 

6 Simple Algebras 

All examples of algebras of minimal rank seen in the previous section 
base on polynomial multiplication. In this section, we study an example 
of a different kind, the algebra Recall that an algebra A is simple 
if its only maximal twosided ideal is the zero ideal. By Wedderburn’s 
theorem, this is equivalent to the fact that there exists a division algebra 
D and an integer n such that A = In other words, simple algebras 

are matrix algebras where the entries of the matrices are elements from 
/c-division algebras. 

By Strassen’s algorithm for multiplying 2 x 2-matrices, < 

7. On the other hand, R{A) > 2 dim A — 1 for any simple algebra by 
the Alder-Strassen bound. Hence, is of minimal rank. It turns out 
that this is the only simple algebra of minimal rank. In fact, the question 
whether for any other the Alder-Strassen bound is sharp, started the 
efforts in characterising the algebras of minimal rank. 

For algebras of the form it was finally proven in [Bla99] that 

> 2n^ + n - 3. (3) 

(This even holds for the multiplicative complexity.) It follows that for 
n > 3, is not of minimal rank. The technique for showing this 
result is the so-called substitution method due to Pan {cf. [Pan66]). We 
will exemplify this method at the end of this section. 

For simple algebras of the form with dim U > 2, it was shown 
in [BlaOO] that 

Ri^D^xn) > 2 dim for any n > 2. 

The methods are centered around the problem of finding elements a,b G 
£)nxn dim([a, b] ■ is large, where [a, b] = ab — ba de- 

notes the commutator or Lie product of a and b. Loosely speaking, the 
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“more noncommutative” an algebra is, the harder is multiplieation in this 
algebra. The methods are beyond the seope of the present artiele, the in- 
terested reader is referred to [BlaOO]. Altogether, the above arguments 
yield the following result. 

Theorem 6.1. A simple algebra A = is of minimal rank iff either 
n = 2 and D = k or n = 1 and D is a division algebra of minimal rank. 

The methods developed in [BlaOO] also yield the assymptoeially best 
lower bound for the rank of multiplieation in simple algebras 

^(^nxn) > |dimT)^^”-3n. 

By adjusting the eorreetion term — 3n appropriately, this bounds holds for 
arbitrary algebras, too. For semisimple algebras, this bound also holds for 
the multiplieative eomplexity. 



6.1 The Suhstitution Method 

As an applieation of the substitution method, we prove that > 7. 

The proof of (3) for larger formats is proven along the same lines, the ar- 
guments are however mueh more involved. Originally, one takes a eom- 
putation and restriets the eomputed bilinear map to say ker fi (one eoor- 
dinate is substituted by a linear form in the other ones). This trivialises 
one bilinear produet. Then one proeeeds with the remaining bilinear map 
in a reeursive fashion. We here present a refined approaeh, basieally due 
to Alder and Strassen. 

Let j3 = {fi, gi,wi, . . . , fr, gr,Wr) be a bilinear eomputation for 
Sinee the multiplieation is surjeetive, Wi, . . . ,Wr generate 
Let U and V be the veetor spaees 



[/ 




and V 




Above, the expressions on the righthand side denote the spaees of matri- 
ees obtained by substituting arbitrary elements from k for the Note 
that (7 is a right and is a left ideal. 

By “sandwiehing”, we ean aehieve that W\ ^ U but wi E U + V . 
We just have to find invertible matriees a and b sueh that a ■ wi ■ b has 
a zero as (1, 1) -entry and a nonzero value as (1, 2) -entry. We elaim that 
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Q2, ... ,Qr generate If this was not the ease, then there would be 

some nonzero x G ni=2 ^er gfj. But then 

a ■ X e k ■ wi for eaeh a G 

This eannot be the case, because ■ x is a left ideal and has dimen- 
sion at least two. Again by a suitable permutation, we can assume that 
92\v:93\v form a basis of V*. 

Next we show that U\u, ■ ■ ■ , fr\u generate U*. If this were wrong, 
then there would be some x G 1 / \ { 0 } such that x G fj- Let 

a G be arbitrary. Since 92\v,93\v form a basis of V*, there is an 
oi G such that 9j{ai) = 9j{a) for j = 2 , 3 . Therefore 

3 

x-{a- ai) = fi{x)gi{a - ai)wi + fj{x) gj{a - ai) wj 

r 

+ '^fi{x)gi{a-ai)wi. ( 4 ) 

Since x eU and U is a right ideal, x -a E U, too. As U is even minimal, 
X ■ = U. Because 14 is a left ideal and a\ G V, xai E U CiV holds. 

Thus ( 4 ) implies 

U = x- Ck-wi + UnV. 

Since wi ^ U, intersecting both sides with U yields U C U HV, a 
contradiction. 

Hence, we may assume that fi\u-, fb\u generate U*. Finally, we show 
that 5'2,fi'3,5'6; ■ ■ ■ , 9 r generate This yields r > 7 . Again, the 

proof is by contradiction. Assume that there is some nonzero 

r 

x' E kei g2 n kei gs fl ker^f^. 

j =6 

Since g2\v , . . . , gslv generate V*, we know that x' ^ V. The same rea- 
soning as before shows 

■ X C k ■ wi + U. 

By construction, the righthand side is contained inU + V. The left hand 
side, however, contains a matrix that has a nonzero entry in position 
( 1 , 1 ). Thus we have encountered a contradiction. 
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7 Semisimple Algebras 

An algebra is called semisimple if rad A = {0}. By Wedderbum’s the- 
orem, a semisimple algebra A is isomorphic to a product Ai x ■ ■ ■ x A* 
of simple algebras A^-. Note that t is exactly the number of maximal 
twosided ideals in A. 

In the course of the proof of their bound, Alder and Strassen showed 
that 

R(A X B) > 2 dim A — 1 + R{B) 

for any simple algebra A and any arbitrary algebra B (cf. [AldStrSl]). 
Applying induction, we get that a semisimple algebra is of minimal rank 
iff each of the simple algebras Ar is of minimal rank. Simple algebras of 
minimal rank have been characterised in the previous section. 

Theorem 7.1. A semisimple algebra A = Ai x ■ ■ ■ x A* is of minimal 
rank iff each simple factor A^. is of minimal rank, that is, A^- is either 
isomorphic to or is a division algebra of minimal rank. 

8 Algebras over Perfect Fields 

The most general characterisation result we know today, is the complete 
characterisation of all algebras of minimal rank over perfect fields^ Note 
that “most” fields are perfect, like all algebraically closed fields, all fields 
of characteristic zero, and all finite fields (which we exclude in this article 
for the sake of simplicity). 

Theorem 8.1. An algebra A over a perfect field k is of minimal rank iff 

A = CiX---xCsX X ■ ■ ■ X X A' (5) 

u times 

where the factors Ci , . . . , are local algebras having minimal rank with 
dim.{Ca/ radCc) > 2, A' is of minimal rank with A'/ rad A' = k^, and 
there exist wi, , Wm e rad A' with WiWj = 0 for i 7 ^ j such that 

rad A' = La' + (wi) H h (wm) = Ra' + (wi) H h (wm)- 

Any of the integers s, u, or m may be zero and the factor A' in (5) is 
optional. 

* Cf. [BlaOl] and [Bla02]. See also [BlaOO] for a characterisation over algebraically closed 
fields. 
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Over finite fields, a similar characterisation result can be obtained, 
too, see [BlaOl], [Bla02] for more details. 

It is interesting to note that algebras of minimal rank are composed 
solely from examples we have already known before. Loosely speaking, 
the only ways to obtain algebras of minimal rank are via polynomial 
multiplication and Strassen’s algorithm for multiplying 2 x 2-matrices. 

9 Open Problems 

The obvious open problem is the following: Determine all algebras of 
minimal rank over arbitrary fields! The reason why the results in the 
preceding section were only proven over perfect fields, is that the proof 
heavily relies on the Wedderbum-Mal’cev theorem^. From the Wedder- 
bum-Mal’cev theorem it follows, that any algebra A over a perfect field 
has a subalgebra B such that A/ rad A = B and B © rad A = A. This 
nice structure property is used in the proof of Theorem 8.1. One actually 
does not need that the field is perfect, one only needs the existence of a 
subalgebra B having the described properties. 

Another interesting problem -but probably a hopeless one in the gen- 
eral case- is the determination of the so-called algorithm varieties of an 
algebra. Basically we want to determine all optimal algorithms (up to 
equivalence) for multiplication in a given algebra. Of course, we have to 
know the rank of the algebra before we can start such a task, thus alge- 
bras of minimal rank are a natural candidate. A remarkable result is the 
proof by de Groote in [dGrV8] that up to equivalence, there is only one 
way to multiply 2 x 2-matrices with seven bilinear multiplications. It is 
beyond the scope of this survey to define the used notion of equivalence 
precisely, but we have already seen some examples: An easy one is the 
permutation of the products, a second one is the scaling of the factors of 
each product. “Sandwiching” is a third example. Further results in this 
direction are [dGr83] and [Feli86]. 

Note added in proof (December 2003). Meanwhile, the author obtained 
a complete characterisation of the algebras of minimal rank over arbitrary 
fields. It turns out that Theorem 8.1 is also valid over arbitrary fields. 



^ Cf„ e.g., [DroKir94], 
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Abstract. These are expanded notes of a lecture on iterated forcing theory. Par- 
ticular focus is put on recently developed techniques like constructions with 
mixed support, or Shelah’s iteration along templates. 



1 What iterated forcing is about 

Forcing, created by Cohen to prove the independence of the continuum 
hypothesis CH from ZFC, the standard axiom system for set theory, was 
soon transformed into a general and powerful technique to obtain a ple- 
thora of independence results. The aim of these notes is to provide a 
glimpse of goals, methods and problems of present-day forcing theory. 
While forcing can be used to get independence of combinatorial state- 
ments about larger cardinals as well, we shall focus on the continuum. 
So assume 99 is a statement about the combinatorial structure of the real 
line. 
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(A) A pattern encountered often is that ip is decided by CH, say, with- 
out loss, 

(i) CH ip 

while it is independent of c > K2, 

(ii) CON(c > Ml -f -i(^) 

CON(c >^i + p>) 

A typical example would be a statement like the real line is not the union 
of less than c many meager sets which is true under CH by the Baire 
category theorem' . 

(B) Some interesting statements are even independent of CH, e.g., 
Suslin’s hypothesis which asserts that every Dedekind-complete linear 
order without endpoints satisfying the countable chain condition c.c.c. 
(that is, every collection of pairwise disjoint non-empty open intervals is 
at most countable) is isomorphic to the real line. Then one typically gets 
both 

(i) CON(CH + ^(^) 

CON(CH + (^) and 

(ii) CON(c > Ki -k -199) 

CON(c > Ki + (^) 

(C) Non-trivial statements which are decided even by c = K2 while 
being independent of ZFC, z.e., 

(i) CH ^ 99 

(ii) c = 'i< 2 ^ ^ 

(iii) CON(c > K2 + -'V?) 

CON(c >^2 + ^) 

have been extremely rare so far. An example is the existence of a Gross 
space, that is, an uncountable-dimensional vector space E over a count- 
able field equipped with a symmetric bilinear form such that the orthog- 
onal complement of every infinite-dimensional subspace has dimension 
less than dim(i?) (see also below. Subsection 2 . 1 ). 

What is the reason for (C) occurring so seldom? — ^First of all c = K2 
is very weak combinatorially so that there is not much implied by it. Yet, 

* As usual c = |R| denotes the size of the continuum. 
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there is a deeper, forcing-theoretic reason: while we have a very good the- 
ory for making the continuum at most K2, i.e., for dealing with problems 
like (A)(ii), (B)(i)(ii), we are severely lacking in iteration theory when it 
comes to forcing c > K2. In fact, there are also a number of statements 
falling into patterns (A) or (B) for which 

(iii) CON (c > K2 -f 
CON(c >^2 + ^) 

is still open. For example, CH implies “there is a set of reals of size c 
which cannot be mapped continuously onto the reals”, and this is con- 
sistently true with c being of arbitrary size, while it is also consistent 
this fails and c = K2. However, the latter consistency is still open when 
c > K2 (cf. [Milo 93 ] or [MiloOO]). 

To get deeper into these problems, let us briefly recall what forcing 
does. It blows up a ground model M to a larger model M[G], called a 
generic extension, 

M ^ M[G] 

in the following fashion: in M, there is a partial order P such that G is 
P-generic over M, that is G is a filter and for all dense subsets D of P 
which belong to M, G P\ D ^ 0 . Except for trivial choices of P, such G 
can be shown not to belong to M, and M[G] then can be described as the 
minimal model containing both M and G. The generic extension process 
can be iterated, 

M ^ M[G] ^ M[G][H] 

where P G M, G is P-generic over M, Q G M[G] and H is Q-generic 
over M[G], There is an alternative way to describe this: namely, one can 
produce in M a partial order P a Q so that forcing with it does the same 
thing as forcing first with P and then with Q, that is, there is a G ^ H 
which is P A Q-generic over M such that M[G] [if] = M[GAii], and the 
two-step iteration can be desribed in one step. Continuing we get 

M ^ M[Gi] ^ M[G 2 ] ^ M[Gs] ^ ... ^ M[G^] 

where Pi = Qo, P2 = Pi aQi, P3 = P2AQ2, ... and the G* are Pj-generic 
over M. But what happens in the limit step? What is the model M[G^j? 
Or, to rephrase this, what is P^j? There may be many possible choices, 
the two simplest ones being the direct limit given by P^ = Un Pn, and 
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thus being the minimal possible choice, and the inverse limit which can 
be thought of as the collection of all “threads”, thus being the maximal 
possible choice. More explicitly, one thinks of as the set of all func- 
tions / with domain oj such that / \n forces (in the sense of Pn) that f{n) 
is a member of Q„. It should be clear that this iterative procedure can be 
continued as far as one wishes through the realm of the ordinals. 

1.1 Classical iterations 

Let us now assume we have an iterated forcing construction 

(Pc,,Q„; a < k) 

with direct limit P^ as sketched above. Here n is an arbitrary limit or- 
dinal; in most of the interesting cases, however, k will be a regular un- 
countable cardinal. There are two main and, by now, classical techniques 
which have been used extensively for adjoining real numbers and, thus, 
for getting consistency results of the form (ii) and (iii) above, namely 

(1) finite support iteration of c.c.c. forcing, the technique originally 
developed by Solovay and Tennenbaum to show the consistency 
of Suslin’s hypothesis mentioned above, 

(2) countable support iteration of proper forcing, created by She- 
lah [She98]. 

For the first, the direct limit is taken everywhere so that this can be 
thought of as forcing with finite partial functions, while in the second, 
we use the inverse limit at ordinals of countable cofinality and the direct 
limit elsewhere which essentially amounts to forcing with countable par- 
tial functions. Some care has to be taken as to what kind of forcing can 
be iterated with which technique so as to avoid collapsing cardinals. For 
example, if countably many have an uncountable antichain (are not 
c.c.c. ), the direct limit will collapse Ki so that finite support iteration is 
ruled out. This is why one uses it only for c.c.c. partial orders. The latter 
class in fact is closed under finite support iterations so that we can iterate 
as long as we wish without collapsing cardinals. Similarly, since count- 
able support iteration is closed under proper forcing, the latter is a natu- 
ral choice for (2). A partial order P is called proper [She98] if for large 
enough x and every countable elementary substructure N of contain- 
ing P, if p G n P, then there is q < p which is (P, A^) -generic. Here 
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q is called (F, N)-generic if for every P-name for an ordinal 7 which 
belongs to TV, g Ih 7 G iV. (That is, if the name is in N, then q forces 
the evaluation of the name to be in TV.) See below. Corollary 2.2, for an 
illustration of how this definition works. The proper partial orders form a 
much larger class than the c.c.c. partial orders. On the other hand, every 
proper partial order preserves Ki. 

Both methods have rather obvious flaws. Let us explain this in some- 
what more detail. Cohen forcing C is the collection of all finite partial 
functions from oj to 2, ordered by extension, i.e., q < p iff q ^ p; more 
generally, given a cardinal A, Ca is the collection of all finite partial func- 
tions from A to 2. A generic for C is called a Cohen real. Now one has 

Fact 1.1. A non-trivial finite support iteration (Pq,,Qq; a < k) adds a 
Cohen real in each limit step of countable cofinality. More generally, it 
adds a generic for Ca at each limit step of cofinality A. 

Proof. The iteration being non-trivial means that Pq, forces that Qq, has 
two incompatible elements, say and q\. Let /r < k be an ordinal of 
countable cofinality. Without loss p = u. Let c be the P^^-name given by: 
p forces c(n) = 0 iff p\n forces that p{n) < (so p forces c(n) = 1 
iff p\n forces that p{n) and are incompatible). It is easy to see that 
c is forced to be C-generic over the ground model. The general case is 
analogous. q.e.d. 

A consequence of this is that if, say, we start with a model for CH, a 
is regular uncountable, and P^ forces c = k, then P^ also forces Martin ’s 
Axiom MA for C, that is, the statement that for every collection of < c 
many dense sets of C there is a filter G C C which meets them all. In- 
cidentally, the latter statement is equivalent to saying that the real line is 
not the union of less than c many meager sets (cov(Ad) = c in symbols), 
a property we considered above. More generally, P^ also forces Martin’s 
Axiom for Ca where \ < k. This is a stronger statement than just having 
MA for C, see below (Subsection 2.1). However, there are many situa- 
tions in which one wants to have cov(Af) small. So what do we do in 
that case? 

In a sense, (2) is a much more powerful technique when it comes to 
making c < K2 for it both avoids the flaw sketched above and applies 
to a much wider class of partial orders. However, it necessarily forces 
c < K2. To appreciate this, let C'^^ be the forcing for adding a Cohen 
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subset ofoji, that is, all countable partial functions from oji to 2 ordered 
by extension; similarly, for A > Ki, is the set of all countable partial 
functions from A to 2. Then 

Fact 1.2. A non-trivial countable support iteration (Pq,,Qo; a < k) 
adds a Cohen subset of oj\ in each limit step of cofinality oji. More gen- 
erally, it adds a generic for at each limit step of cofinality A. 

The proof is the same as the one of Fact 1.1. Furthermore 
Fact 1.3. Forcing with forces CH. 

Proof. Let {/q,:o;— *> 2; o;<c} enumerate the reals of the ground 
model. Note that given a condition p G and a < c, we can extend p 
to a condition q such that for some limit ordinal (3 < uji, g(/9+n) = fa{n) 
for all n, that is, q codes the real fa at f3. An easy density argument then 
shows that in the generic extension, all ground model reals are coded at 
some (3 by the generic function. This means that the size of the set of old 
reals has become Ki. On the other hand, being cc-closed, does not 
add any new reals so that c = Ki in the generic extension. q.e.d. 

Now assume (Pq, a < k) is a countable support iteration which 
adds new reals in each stage where k = U 2 + uji. Since reals are added, 
P^2 forces c = K2. By Facts 1.2 and 1.3, after the subsequent cci steps 
CH is forced so that K2 has been collapsed by stage n if not earlier. This 
shows that with a countable support iteration one can never go past c = 
K2, and this is the main reason there have been fewer consistency results 
with c > K3, a fact mentioned above. 

2 Recent developments 

2.1 Iterations with mixed support 

Constructions with mixed support have been used for making Martin’s 
axiom MA for large Cohen algebras Ca fail. We illustrate this with a 
simple paradigm due to Shelah [FucSheSou97], namely, adding Cohen 
reals with mixed support. This is a product construction, and no iteration 
in the strict sense, but the general method works along the same lines. 

Let K be a cardinal. The p.o. P = P^ consists of countable partial 
functions p : k x u ^ 2 such that {n; p{a, n) is defined} is finite for 
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all a. Denote by p{a, ■) the restriction of p to {a} x u. Set supp(p) = 
{ q :; p{a, ■) 7^ 0 }, the support of p, and define the ordering < by q < p 
iff g 0 p and [a G supp(p); g(a, •) ^ ■)} is finite. P has the 

following properties: 

- Assuming CH, P has the ^2-chain condition which entails that forc- 
ing with P preserves cardinals > K2 (this is a standard A-system 
argument). 

- P is proper so that forcing with P preserves Mi (Lemma 2. 1 and Corol- 
lary 2.2). 

- P adds K, many Cohen reals (this is easy) and thus forces cov(Af ) > 

K. 

- If <0 holds in the ground model, ^ holds in the generic extension (see 
below for definitions etc.). 

Say that q <0 pit q < p and q{a, •) = p{a, ■) for all a G supp(p). Note 
that <0 is an cu-closed partial order. The following argument is archetypal 
for constructions with mixed support. 

Lemma 2 . 1 . Let 7 be a P-name for an ordinal, p G P. Then there is 
q <0 P such that the set = {r < q; supp(r) = supp(g) and r 
decides 7} is predense in P below q. [FucSheSou 97 ] 

Proof. Suppose not. We recursively construct sets {p^; C < and 
( < cui} of conditions such that 

(i) Po <0 P and p^ <0 p^ for ^ > C, 

(ii) < p^, supp(g<^) = supp(p^), 

(iii) if g^(o;, ■) ^ P^a, ■), then a belongs to U5<c ®npp(p^), 

(iv) g^ decides 7, and 

(v) the q(; are pairwise incompatible. 

This can be done by assumption. For indeed, suppose we are at step ( 
of the construction, and p^, g^, ^ < C> have been produced. Set p'^ = 
P^- Clearly p'^ <0 p and, by assumption, we can find g^ < p'^ decid- 
ing 7 which is incompatible with all r < p'^ which decide 7 and have the 
same support as p'^. A fortiori, g^ is incompatible with all previous g^ so 
that (iv) and (v) are satisfied. Defining p^ such that it agrees with p'^ on 
supp(p'^) and with g^ on supp(g<^)\supp(p'^), we guarantee (i), (ii) and 
(iii). 
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Define F ; cui — cui by F{() = min{^ < (■, for all a, if g^(o;, ■) 7 ^ 
P(^(q;,-), then a G supp(p^)}. By (iii) and by definition of the partial 
order, F is regressive, i.e., F{() < ( for all (. Using Fodor’s Lemma we 
see there are a stationary set S' C c<;i and a Co such that F(C) = Co for all 
C G S'. By pruning further, if necessary, we may assume without loss that 
there is r G P, supp(r) = supp(p<^p), r < such that 5 '^ I'(supp(r) x 
uj) = r for all ( e S. This means, however, that all C ^ S', are 
pairwise compatible, contradicting clause (v). q.e.d. 

Corollary 2.2. P is a proper forcing notion. [FucSheSou97] 

Proof. This is a standard argument. We include a proof as an illustration 
of how properness works. Assume is a countable elementary substruc- 
ture of some containing P and a condition p. Let {'jn', n G a;} list the 
P-names for ordinals belonging to N. By repeatedly applying the pre- 
vious lemma inside N, we can construct a <o-decreasing sequence of 
conditions G P fl A^, <o-below p, such that is predense in P 

below Qn. (Notice, however, that the whole sequence {g„; n & uj} does 
not belong to N .) Put g = IJn We claim that g is a (P, A^)-generic 
condition. 

For indeed, assume r < g and r Ih 7 n = <^- Since is predense 

below g there is s G Dq^^A^^ compatible with r. Since s forces a value to 
7n, we must have s Ih 7^ = 5 . As Dq^^A^^ e N is countable, Dq^^A^^ C N 
and, a fortiori, s G N and 5 G N, as required. q.e.d. 

What is this good for? Recall the combinatorial principle <0 (dia- 
mond) asserts the existence of a sequence (A^ F a\ a < uji) such that 
for every subset B F uji there is a < cui with BHa = Aa- By J(t (club), 
we mean that there is a sequence (A^ F a cofinal; a < cui is a limit 
ordinal) such that for all uncountable F C cui there is a with Aa F B. 
Finally, | (''stick") says there is a family A of countable subsets of ui of 
size Ki such that every uncountable subset of uji contains a member of 
A. These principles play an important role in a number of applications 
of set theory to other fields of mathematics. 

The definitions immediately give the implications <0 ^ ^ t and 

<0 ^ CH ^ I . Furthermore, it is known that <) is equivalent to CH + J|k. 
The connection to forcing axioms is given by the easy 

Fact 2.3. If I holds, Martin’s axiom fails for 
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Proof. Let M be a model of (a large enough fragment of) set theory of 
size Ki containing a witness A for | . Note there can be no -generic 
subset B over M, for, by genericity, such B would not contain any count- 
able set of M, violating that .4, is a witness for | . This means there is no 
filter meeting the (Ki many) dense sets of of M. q.e.d. 

Using an argument very similar to Lemma 2.1, one proves that a wit- 
ness for <0 in the ground model still witnesses ^ in the generic extension 
via P. (In fact, <) is not necessary, for it can be shown that P generically 
adjoins a J|k-sequence anyway.) Thus, one gets a model where MA for 
countable partial orderings (for C) is true (equivalently, where the cover- 
ing number cov(Af ) for the meager ideal is large) while ^ holds which 
is a strong way of saying MA fails for by Fact 2.3. 

We close this subsection with a list of results which have been ob- 
tained by similar, albeit more complicated, methods. 

- Adjoining many random reals with mixed support (that is, a mixed 
support version of a large measure algebra), one gets a model where 
the covering number cov(AA) of the null ideal is large while Jit holds. 
[Breicx)a] 

- Proper non-c.c.c. iterations with mixed support. They are typically 
used to show the simultaneous consistency of ^ with something else 
(e.g., the additivity of the meager ideal^ add (Ad) being K 2 ), or to 
prove various versions of ^ may not coincide [DzaShe99]. In most 
models of this kind 0 = ^ 2 . 

- However, with a very sophisticated iterated mixed support construc- 
tion involving “historic” definition of the conditions, Shelah has 
shown the consistency^ of c = K 3 (or larger) and “there are no Gross 
spaces” [Shecx)a]. In this model, for a-centered partial orders 
and a version of ^ for o ;2 hold simultaneously (so that MA fails for 
Cuj 2 in a strong way). 

- There is a standard way of iterating a-centered forcing with mixed 
support making c arbitrarily large [BrciOOa]. However, Jit does in gen- 
eral not hold in the generic extension, and the method has no appli- 
cations so far. 



^ The additivity of the meager ideal add (At) is the size of the smallest family of meager sets 
whose union is not meager. 

^ The statement c = H2 is known to imply the existence of a Gross space [Breiooa], see above. 
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Problem 2.4. Is and add(TW) > K 2 (simultaneously) consistent? 

We expect consistency can be obtained by a mixed support construc- 
tion. 

2.2 Iterations along templates 

Let us step back for a moment and rethink what an iteration is after 
all. In most of the interesting cases used in applications, including the 
constructions with mixed support discussed above, we define, by a si- 
multaneous recursion, both the iteration Pq, (that is, the initial segment 
of the eventual forcing) and the iterands (with which we want to 
force at a certain step a). As discussed before, we thus get a system 
(1 Pq,Qo; a < k) where the connection between the Pq, and the Qq is 
given by Pq+i = Pq ^ Qq. The forcing we eventually force with is the 
direct limit P^ of the Pq’s. 

However, in utmost generality, an iteration is nothing but a system 
(Bj; f e I) of complete Boolean algebras and complete embeddings 
{cij ; Bj Bj; i < j) between them, where (/, <) is a directed index 
set. The forcing notion defined by this system is the direct limit of the 
Bj’s. This is all nice but this general approach has not proved useful at all 
so far. On the other hand, considering only iterations of the (Pq, Qq; a < 
K)-form seems to be too restrictive. Note, the latter are a special case of 
the former, satisfying additionally 

- wellfoundedness of the index set /, 

- / is linearly ordered, 

- the initial segments of the iteration and the new iterands are handed 
down simultaneously. 

Which of these conditions are needed? There is no a priori need for the 
second and third conditions, but the status of the first is somewhat differ- 
ent. While it can be avoided in a number of cases, e.g., 

- since adding Cohen reals is a commutative process, it is irrelevant in 
which order they are adjoined, and we may think of them as being 
adjoined along some illfounded order, 

- the same is true for random reals, if added with a large measure alge- 
bra, 

- illfounded iterations of Sacks forcing have been considered as well. 
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this is not true in general. In partieular, when it eomes to adding dominat- 
ing reals, illfounded iterations are forbidden, for there eannot be models 
of set theory (M„; n G oo) with Mn+i C M„ and reals fn dominat- 
ing over Mn sueh that (/*; i > n + 1) G M^, by a theorem of Hjorth 
(see [Brei02] for a more detailed diseussion of all this). 

As a paradigm for a method satisfying the first eondition but not the 
seeond and third, we briefly deseribe how to iterate Heehler foreing along 
a template. Iterating along a template is a novel teehnique, introdueed by 
Shelah [Shecx)b] to show the eonsisteney of c) < a (see also [Brei02]). 
Heehler forcing D eonsists of all pairs (s, /) where s G and / G 
with s C f. The order is given by (f, g) < (s, /) if f ^ s and 
g{n) > f{n) for all n. D is a e.e.e. (even a-eentered) foreing notion 
whieh generieally adds a dominating real, that is, a real d G uj‘^ sueh that 
for all / G of the ground model, d{n) > f{n) holds for almost all 
n. To see this simply set d = lj{s G {s, f) belongs to the generie 
filter for some /}. 

Definition 2.5. Let (L, <) be a linear order, and set = {y & L] y < 
x} for all X G L. {L,I) is a template if X is a family of subsets of L 
satisfying 

(1) 0,XgX, 

(2) X is elosed under finite unions and interseetions, 

(3) if 7/ < X belong to L, then there is A G X fl p{Lx) sueh that 
y ^ A 

(4) if A G X and x ^ L \ A, then A fl X^; G X, and 

(5) X is wellfounded under C, Le., there is a funetion Dp ; X — > On, 
reeursively defined by Dp(0) = 0 and Dp(A) = sup{Dp(i?) -f 
1; i? G X and i? C A} for A G X \ {0}. 

We are ready for the definition of the iteration. 

Definition 2.6. Assume (X,X) is a template. We define, for A G X, by 
reeursion on Dp(A) the p.o. PfA. 

- Dp(A) = 0, i.e., A = 0. LetPf0 = {0}. 

- Dp(A) > 0. Then P(A eonsists of all finite partial funetions p with 

domain eontained in A and sueh that, letting x = max(dom(p)), there 
is 5 G X n p{A n Lx) (so Dp(i?) < Dp(A)) sueh that p ((A fi L^) G 
F\B and p{x) = where sg G and /| is a Pfi^-name 
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for an element of sueh that p|'(A fl L^) ll-pfs C (so this 
means that p{x) is a P (5-name for a eondition in Heehler foreing D). 
The ordering of P(A is given by: q <pfA P if dom(g) D dom(p) and 

• either y = max(dom(g)) > x = max(dom(p)) and there is 
5 e X n p{A n Ly) such that p,q\{A fl Ly) G P(5 and q\{A fl 

Ly) p, 

• or X = max(dom(g)) = max(dom(p)) and there is 5 G X fl 
p{A n L^) such that p\{A n L^),q\{A n L^) G P(5, fpj^ are 
P 1'5-names, ((A n L^) <p^s p\{A n L^), C 4 , and q\{A n 
Lx) IhptB ''fS(n) < f^{n) for all n” (the last two clauses mean 
nothing but g ((A nXa;) IhpfB ''q{x) <jjp(a;)”). 

The first thing one needs to check is that this recursive definition 
makes sense at all, namely, that the orderings <p|^a defined along the way 
are transitive. To this effect, one proves by induction (simultaneously to 
the recursive definition) that the p.o.’s completely embed one into the 
other, i.e., that for A, 5 G X, A C 5, P (A is a subforcing of P|'5. 
The latter is crucial by itself, for it shows that we indeed deal with an 
iteration in the usual sense. While complete embeddability is a triviality 
for iterations of the (Pq, Qq,; a < /t)-form, this is not so anymore in the 
present context. 

In an iterated forcing construction like the one described above, the 
recursive definition of the iteration and the description of the iterands are 
separated in a translucent way. Namely, while the latter are indexed by 
members of X, the former is indexed by members from the wellfounded 
set X. In general, X is not linear, and X is not wellfounded. In fact, there 
is no reason at all to consider only iterands which are ordered linearly, 
and, using a somewhat more complicated definition, the above can be 
extended to the situation where X is an arbitrary partial order. 

Results obtained so far with this method include: 

( 1 ) The consistency of h < a [Shecx)b]. 

The dominating number c) is the size of the least X such 
that every function in is eventually dominated by a member of 
T . A family AX [uY is called almost disjoint if any two distinct 
members of A have finite intersection. The almost disjointness 
number a is the size of the least infinite maximal almost disjoint 
family. The proof goes roughly as follows: 
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Assume CH in the ground model. Then (T,X) will be a template 
such that L has a cofinal subset of size 0J2. This guarantees we 
get a cofinal family of Hechler reals of cofinality uj2 so that b = 
b = K2 will hold"*. Since 0 > b in ZFC, it suffices to prove that 
no almost disjoint family of size K2 can be maximal in the generic 
extension. Suppose A = a < k = LO2 + UJ2} were a name 
for such a family. Reordering the Aa, if necessary, and using CH 
and a A-system argument, we may assume the names A„, a < 092, 
are isomorphic. The exact definition of (L, X) now is such that an 
additional name can be found such that, for any (3 < n, the 
complete subalgebra generated by and A^ is isomorphic to the 
complete subalgebra generated by A^ and A^ for some (almost 
all) a ^ ( 3 , a < 0J2. This means A^ is forced to be almost disjoint 
from all A/j, so that A was not maximal. 

( 2 ) Shelah [Shecx)b] also showed the consistency of u < a (using the 
consistency of the existence of a measurable cardinal)^. Furthermore, he 
obtained that a can be singular (of uncountable cofinality). 

( 3 ) The number a can consistently be singular of countable cofinality 
(e.g., a = is consistent). [BrciOS] 

Let us close with two problems. 

Problem 2.7 (Shelah). Show the consistency of u < a on the basis of 
ZFC. [SheOO] 

We expect this can be done with a template- version of iterated Math- 
ias forcing. 

Problem 2 . 8 . Show the consistency of cov(A/’) = K3 and cov(Ad) = K2. 

While iterations along a template as sketched above seem not readily 
adaptable to this problem, we expect the solution to be along similar 
lines in the sense that it should be a recursively defined iteration whose 
iterands are given separately. 

The unbounding number fa is the least size of an unbounded family T C o;“, i.e., there is no 
single g £ which eventually dominates all members of T . 

^ The ultrafilter number u is the smallest size of a base of an ultrafilter on ui. 




60 JORG BRENDLE 

For more extensive lists of problems see either [Milo93] (with a re- 
cent update online with references: cf. [MiloOO]), or the more recent 
[SheOO] (in particular, Sections 2 and 3) or Section 5 in [BrciOl] both 
of which also contain a more extensive list of references than the present 
paper. 

Note added to the revised version (February 2002). Problem 2.8 
has been solved positively by the author since the original writing of this 
article.^ 



® Cf. [Breioob], 
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Quite often in mathematics, people deal with the problem of classi- 
fying objects up to some notion of similarity, like isomorphism, homeo- 
morphism, isometry, biembeddability, etc. This usually amounts to find- 
ing necessary and sufficient criteria for two objects in the domain of dis- 
cussion to be equivalent. In other words, the goal is to find a criterion $ 
which, applied to objects x, y, says that these are equivalent if and only 
if <h(a;) = $(?/). So, these values <h(a;) are called complete invariants 
for the equivalence. In order to be useful, this assignment x i— > <h(a;) 
should be explicitly definable. Moreover, particularly for classification 
problems arising in real analysis, topology or algebra, the class of ob- 
jects to be classified very often carries a natural structure turning it into 
a Polish space (separable, completely metrisable topological space). So 
the notions of similarity alluded to above are really equivalence relations 
on such spaces. Let me recall some examples to illustrate this. 

Universal Polish metric space. There is a Polish metric space U, 
called Urysohn space, such that every Polish metric space is isometric to 
a closed subspace of U. The collection F(f/) of closed subspaces of U, 
which is a standard Borel space endowed with the Effros Borel structure, 
can thus be regarded as the class of all Polish metric spaces. Relations 
among Polish metric spaces like homeomorphism or isometry are then 
equivalence relations on F(f/). Similarly compact metric spaces are iso- 
metric to compact subsets of U ; thus K(f7) (the Polish space of compact 
subsets of U) is the class of all compact metric spaces. 

Universal compact metrisable space. Hilbert cube [0,1]^ contains a 
topological copy of every compact metrisable space. So K([0, 1]^) is the 
Polish space of all compact metrisable spaces. Continua (that is compact 
connected non empty metrisable spaces) form a closed subset C([0, 1]^) 
of K([0, 1]^). These spaces are suitable for discussing questions con- 
cerning topological but not metric considerations about compact Polish 
spaces. 

Spaces of countable structures. If L = {Ri, fj,Ck \ i G I ,j G J,k G 
K} is a countable first order language and ar denotes the arity function, 
then L-structures with universe N can be coded as elements of the Polish 



i&l j&J 



space 
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The relation of isomorphism on countable L-structures is an equivalence 
relation on X^. This is the orbit equivalence relation of a continuous 
action -called logic action- of S^o, the symmetric group of N, on 
(given a group G acting on a set A, the orbit equivalence is defined by 
letting a, a' G A be related if and only if there is a g G G such that 
ga = a'). If 99 G Soo,x G X^, the result (^x of applying to a; is 
the countable L-structure obtained from x by relabeling the elements 
of the universe of the structure using (p. For example, if L consists of 
one binary relation symbol, then for all natural numbers n and m one 
has (fx{n,m) = x{p>~^{n),(p~^{m)). If ^ C is a Borel subclass of 
Xl closed under isomorphism, then the restriction of the isomorphism 
relation to A can be considered as well. 

It remains to decide how to compare, in some explicit way, differ- 
ent classification problems, that is different equivalence relations. There 
are several ways to do this; the theory of Borel reducibility is the conse- 
quence of the following choice. While encompassing most of the usual 
explicit constructions of mathematics, this choice allows to use many 
techniques from descriptive set theory. 

Let X, Y be Polish spaces and let E, F be equivalence relations de- 
fined on X, Y respectively. Say that E is Borel reducible to F if there is 
a Borel function g : X ^Y such that 

Vx, x' G X {xEx' ^ g{x)Fg{x')). 

This is denoted by <b F. Indeed, when this situation arises, in order 
to know whether x,x' G X are similar with respect to the notion of sim- 
ilarity E it is sufficient to look whether elements g{x) and g{x') in Y are 
similar with respect to F. So the classification problem for E is at most 
as complicated as the classification problem for F and this justifies the 
notation E F. Moreover, the requirement that g be Borel grants that 
this comparison is done in some explicitly definable way. Of course it is 
possible to study different notions of reducibility, like the ones obtained 
by requiring the function g above to be continuous or measurable with 
respect to some fixed measures on X, Y or to some a-algebras other than 
the Borel one. 

HE <B F but <B E), then the classification problem for E 
is strictly simpler than the one for F; this is denoted by E <b F. If 
E <B F, F <B E both hold, then E, F have classification problems of 
the same complexity; this can be written as E ~b F. 
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Note that this notion does not involve explicitly the topology on the 
spaces where the equivalence relations are defined, but just the a-algebra 
of Borel subsets. So these definitions may be extended to equivalence 
relations defined on sets where, rather than a Polish topology, it is more 
natural to find a cr-algebra of subsets that is the cr-algebra of Borel sets for 
some Polish topology. A set X together with such a a-algebra is called a 
standard Borel space. 

Since <b is reflexive and transitive, it is a preorder in the class of 
equivalence relations on standard Borel spaces. Identifying equivalence 
relations E, F whenever E ~b F, the relation <b induces a partial or- 
dering. The study of this partial ordering is the scope of the theory. 

1 Smooth equivalence relations 

The easiest case of a classification problem for an equivalence relation 
E ona standard Borel space X is when it is possible to assign complete 
invariants that are themselves concrete objects, namely elements of some 
Polish space Y . This means that there is a Borel function g \ X ^ Y 
such that, for x^x' G X,xEx' ^ g{x) = g{x')\ using the notation 
introduced above, E <b =y where =y (in the sequel simply denoted 
Y) is equality on Y . Such an equivalence relation E is called smooth 
or concretely classifiable. Note that such an E is Borel as a subset of 
The cardinality of the quotient space X/E can be finite, countably 
infinite or the continuum. Thanks to the Silver dichotomy, which asserts 
that for a II} equivalence relation E either E <b N or M <b E, smooth 
equivalence relations i?, F on X, F respectively are always comparable 
with respect to <b and they are ranked according to the cardinality of 
their quotient spaces: E <b F ^ card(X/F) < card(F/F). So, up to 
~B> there are countably many smooth equivalence relations, which can 
be listed as follows: 



0 <B 1 <B 2 <B . . . <B n <B . . . <B N <B K (1) 

(for notational uniformity the empty space and the unique equivalence 
relation on it are considered as well). 



Examples. 1. For n a positive natural number, let Xi{n x n, C) be the 
space of all n X n complex matrices. The equivalence relation of similar- 
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ity on M.{n x n, C) is a smooth equivalence relation: such matrices are 
classified by their Jordan normal form. 

2. Isomorphism for Bernoulli measure preserving automorphisms on 
[0, 1] is smooth by [OrnVO], since these are classified by their entropy, a 
real number. 

3. If U is the Urysohn space, the isometry relation on K(17) (that is 
isometry for compact metric spaces) is smooth {cf. [Groo99]). The situ- 
ation is very different for isometry on all of F((7). The latter has been 
classified precisely in [GaoKecOS]. 

Smoothness is perhaps the most desirable situation when dealing with 
an equivalence relation, in the sense that in this case one usually has a 
very good understanding of the equivalence. However this turns out to be 
a rather uncommon situation in mathematics. 

2 Countable Borel equivalence relations 

A countable Borel equivalence relation E on n standard Borel space X 
is an equivalence relation that is Borel as a subset of and such that all 
equivalence classes [x]e are countable. If F is a Borel equivalence rela- 
tion, then F is essentially countable if, for some countable Borel equiv- 
alence relation E, the inequality F <b E holds. Smooth equivalence 
relations give thus the easiest examples of essentially countable Borel 
equivalence relations. Other examples of countable Borel equivalence 
relations are orbit equivalences induced by Borel actions of countable 
groups on standard Borel spaces. In fact, this is the only actual case by 
the following theorem of [FeloMoooVV]. 

Theorem 2.1. If F is a countable Borel equivalence relation on a stan- 
dard Borel space X, then there are a countable group G and a Borel 
action G x X ^ X inducing F. 

A very interesting fact concerning Borel equivalence relations is the 
Glimm-Effros dichotomy which, in the following general form, is due to 
[HaroKecLou90]. 

Theorem 2.2. There is a non-smooth countable Borel equivalence re- 
lation Eq such that, for any Borel equivalence relation F, either F is 
smooth or Fo <b F. 
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So, for Borel equivalence relations, the chain (1) can be continued as 

0 <B 1 <B 2 <B • • • <B n <B • • • <B <B <B Eq (2) 

and this chain contains all Borel equivalence relations (up to ~b) that are 
Borel reducible to Eq. 

2.1 Recognizing non-smoothness 

When trying to classify an equivalence relation, a first step may be to 
show that it is not smooth. As for other similar problems, to show E is 
not smooth it is enough to find an equivalence relation E, already known 
to be non-smooth, and show E <b E. By the definition, this amounts 
to set up a construction that assigns to F-equivalent elements some E- 
equivalent objects but, to F-inequivalent elements, i?-inequivalent ob- 
jects must be associated. This last requirement often involves some kind 
of diagonalisation. As an illustration, a couple of combinatorial argu- 
ments are discussed here. 

Recall the discussion of the space of countable L-structures for a 
given countable first order language L and its Borel invariant subclasses. 
If G is a subgroup of S^o then the restriction of the logic action to G x 
(or G X A where A is the invariant class under consideration) induces 
a subequivalence relation which may be called G -isomorphism. An in- 
stance of this is when G = Rec is the group of recursive permutations 
and A = [m]~ is a single isomorphism class. If the subgroup G is not 
closed in S'oo, then the (right or left, equivalently) coset equivalence Cq 
induced by G on S'oo is not smooth, by [Mili77]. This will be compared 
with G-isomorphism on some isomorphism class [m]~. 

For G, H subgroups of S'oo, let Eq be the equivalence relation on S'oo 
defined by letting xEq y if and only if there are v? in G and a in H such 
that ipxa = y. 

Proposition 2.3. Let 77 be the automorphism group of m g X^ and G 
be a subgroup of S'oo- Then Eq ~b— g- 

Proof. Define * : S'oo ~ ^ [u]^A ^ letting x* be the model obtained 
by applying x to u. So * is continuous and surjective. To check that it is a 
reduction, let x,y e S'oo- If there are (p e G,a e H such that pxa = y, 
since a is in the stabiliser H of u, p is an isomorphism between x* and 
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y*. Conversely, let ^ G G be an isomorphism between x* and y*\ then 
y~^ipx is an automorphism a of u, so ipxa~^ = y. 

Sinee H is elosed, then * has a Borel right inverse witnessing =g<b 

Eq. q.e.d. 

Corollary 2.4. If m is a rigid strueture, then for any subgroup G C S'oo, 
Cg ~b— g- partieular, if G is not elosed, is not smooth. 

Proposition 2.5. Let G, H be eountable subgroups of Soo- Suppose G is 
elosed under the following operations (this happens, for instanee, when 
G = Rec): 

- if 99 G G and ; N ^ N is defined, for /c G N, by 

^\2k) = 2(f{k) , ip’{2k + l) = 2k + l, 



then ip' E G. 

- if p E G and if p'{k) = defines a bijeetion p' from N onto N, 
then p' E G. 

Suppose moreover that there exists a set A C N with the following prop- 
erties: 

1. A is if -invariant; 

2. A is infinite and eoinfinite; 

3. H aets freely on A. 

Then Gq ^b -^g • 

Proof. Distinguish two eases. 

Case 1. There is a finite orbit of the aetion of H on A. By eondition 
3, every orbit of if on A is finite with fixed eardinality m E N and 
there are infinitely many sueh orbits. Moreover, H must also eonsist of m 
elements: let H = {id, Ui, . . . , <Trn-i}- Let (a°}neN be a transversal for 
the aetion of if on A and, for n G N, 1 < /r < m, let = a/i(a°). Thus 
^ = {o^}(n,/i)eNxm- Let also N\t 4 = be an enumeration without 

repetitions. For every h < mlet (g^jnsN be an injeetive sequenee of odd 
numbers sueh that: 

- h^h' ^ {^nlnGN H 
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- for every h ^ 0, there is no function in G mapping to for all 

n e N. 

For X e Soo, define x* G Soo letting 

x*{bn) = 2x{n) , x*{a^) = q^. 

So * : Soo — ^ Soo,x I— > x* is a continuous injection. To check that it 
reduces Cg to Eq, let x,y E Soo- Assume cpx = y for some cp E G. 
Define 99' : N — N letting, for k E N, 

ip'{2k) = 2if{k) , <p'{2k + l) = 2k + l. 

Then ip' E G and p'x* = y*. Conversely, suppose px*a = y* for some 
p E G,a E H. Then, by evaluating on each a° and using condition 3 
above, a = id and px* = y*; in particular, p maps even numbers to even 
numbers and it is the identity on the odd numbers. Define P\k) = ^ 
for all natural numbers k; then p' E G and p'x = y. 

Case 2. Every i/-orbit on A is infinite. So it can be assumed that A 
consists of just one infinite iJ-orbit. Let N \ A = {&n}neN be an enumer- 
ation without repetitions. Let also X be the Polish space of bijections 
from A to 2N + 1. The claim is that 

E X Vcr E II \ {id} Vp E G 3a E A pzcr(a) 7^ z(a). 

Indeed, for every a E H \ (id), 99 G G, the set {;^ G X | 3a G 
A pza{a) 7^ z{a)} is open dense in X. Define * : 5*00 — Soo,x 1— > x* 
letting x*\a = z and x*{bn) = 2x{n) for all n. Then * is continuous; to 
check that it is the desired reduction let x,y E Soo- Suppose first px = y 
for some p E G and define </?' : N — N by letting, for k eN, 

p'{2k) = 2p{k) , p'{2k + l) = 2k + l. 

Then p' E G and p'x* = y* . Assume conversely that there are p E 
G,a E H such that px*a = y*. This implies that for all a G A one 
has pza{a) = z{a) and so a = id, whence px* = y*. As before, let 
p'{k) = for all /c G N; then p' E G and p'x = y. q.e.d. 



Corollary 2.6. Let L be the language for group theory and Gr(Z) be 
the class of groups isomorphic to Z. Assume G is a countable subgroup 
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of Soo satisfying the closure hypotheses of Proposition 2.5. Then 
Borel reduces to G-isomorphism on Gr(Z). Thus, if G is not closed, 
G-isomorphism on Gr(Z) is not smooth. In particular this holds for G = 
Rec. 

All technical effort in the proof of Proposition 2.5 was essentially to 
get rid of the presence of the group H, making it impossible for two ele- 
ments in the range of the reduction to be related because of the effect of 
some non trivial member of H. This made sure that if x,y were Cq- 
inequivalent, then their images x*,y* remained i?^-inequivalent. An- 
other illustration of this kind of arguments is the following. 

Proposition 2.7. Let G be a countable subgroup of Soo satisfying the 
closure hypotheses of Proposition 2.5 and let r : N ^ N be defined by: 

• for all even n, we let r(n) := n + 2; 

• r(l) := 0; 

• r(m) := m — 2 if m G 2N -f 1, m > 3. 

Then Cg <b where 



Vx, y G Soo {xE''q y 399 G G 3;^ G Z y)xr^ = y). 

Proof. Let {gnjnez = 2N -f 1 be an injective enumeration of odd natural 
numbers, indexed by integers, such that (a) forp G 2Z\ {0}, the function 
Qn ^ Qn+p is not the restriction of any element of G; and (b) for p G 
2Z -f 1, the function g 2 n ^ Q 2 n+p is not the restriction of any element of 

G. 

Define *: Soo ^ Soo,x 1 — > x* letting, for n G N: 

x*{6n) = 2x{2n); x*(6n -f 1) = q- 2 n-i] x*{6n + 2) = q 2 n, 
x*{6n -f 3) = q- 2 n- 2 ] x*{Qn -f 4) = g 2 n+i; x*{6n -f 5) = 2x{2n + 1). 

So * is a continuous injection; to check that it is actually a reduction, let 
x, p G Soo- Assume first (px = y for some ip E G and define ^jJ : N ^ N 
by: 



for all even m, we let -^(m) := 2(p(y); 
for all odd m, we let ^jJ{m) := m. 
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Then ip ^ G and ipx* = y*. Suppose conversely that ipx*r^ = y* for 
some G G, ^ G Z. It must be 2 : = 0 since otherwise requirements (a) 

and (b) would be contradicted. This implies that ip sends even numbers to 
even numbers and it is the identity on odd numbers; so define : N — > N 
by (p{n) = Then ip e G and cpx = y. q.e.d. 

Corollary 2.8. If G is a countable subgroup of Soo satisfying the closure 
hypotheses of Proposition 2.5, then the equivalence relation Go Borel 
reduces to G-isomorphism on total orderings of type (, the order type of 
the integers. In particular, this holds for G = Rec. 

Proof. Let L = {R} he the language for order theory and let m G = 
2^^ be defined by letting u{n, m) = 1 if and only if one of the following 
holds: 

(n, m G 2N and n < m) or (n G 2N + 1 and m G 2N) 
or (n, m G 2N + 1 and m < n). 

Then [m]^ is the class of total orderings isomorphic to C and the automor- 
phism group of u is where r is as in Proposition 2.7. Now apply 

Propositions 2.3 and 2.7. q.e.d. 

In particular, the following has been proved. 

Theorem 2.9. The relations of recursive isomorphism on rigid struc- 
tures, groups isomorphic to Z and total orderings of order type C are 
non smooth countable Borel equivalence relations. 

2.2 Hyperfinite equivalence relations 

Definition 2.10. An equivalence relation i? on a standard Borel space 
X is called hyperfinite if there is a sequence En of Borel equivalence 
relations on X with finite classes such that 

• i7„ C En+i for all n G N; 

• E = UneN 

The following characterisations of hyperfinite equivalence relations 
are very useful (cf. [DouJacKec94]). 

Theorem 2.11. The following are equivalent for any countable Borel 
equivalence relation E on standard Borel space X: 
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1. E is hyperfinite; 

2. E is hypersmooth, that is E is the increasing union IJneN where 
each En is a smooth equivalence relation; 

3. E <B E^; 

4. there is a Borel action of Z on X inducing E', 

5. there is a Borel partial ordering < defined on X whose restriction 
to each equivalence class is a total order which is finite or has order 
type C- 



Examples. 1. Let elements x,y e 2^ be equivalent if they are eventually 
equal, that is there is n G N such that, for all m >n, x{m) = y{m). This 
hyperfinite equivalence relation is the one usually denoted Eq. 

2. Again for x,y e 2^, let x E^ y if x,y have the same tail, that is 
there are n, m G N such that, for all A; G N, x{n + k) = y{m + k). 

3. For x,y X Ey y X — y & Q. This is the Vitali equiva- 

lence relation. 

Note that Eq ~b -E^t ~b Ey, since they are hyperfinite but not 
smooth. 

Open problems 1. [DouJacKec94] The increasing union problem. Let 
X be a standard Borel space and suppose that, for each n e N, E^ is a 
hyperfinite equivalence relation on X. Suppose further that En C En+i 
for all n.ls E = IJneN hyperfinite? 

This is a very interesting structural open problem. The question is 
spontaneous and simple to state but apparently very hard to solve. Among 
the various instances of this problem let me single out the following. 

2. [JacKecLou02] Commensurability. For x, y G define xE^y 
^ G Q^. Is Ec hyperfinite? 

The relation Ec is the relation of commensurability. It is one of the 
most ancient equivalence relations in the history of mathematics and thus 
it appears natural to try and settle its classification problem. Moreover, it 
is the multiplicative analogue of Vitali equivalence relation. To observe 
that this is indeed a particular case of Problem 1, note that the multi- 
plicative group Q+ is isomorphic to the additive group The latter 
is the increasing union of subgroups isomorphic to Z” for n G N and it 
can be shown that Borel actions of such groups on standard Borel spaces 
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induce hyperfinite equivalence relations. On the other hand, classifica- 
tion of Ec would likely provide clues for the solution of the more general 
Problem 1. 

3. (Weiss) Borel actions of countable Abelian groups. Let G be a 
countable Abelian group and Eq be the orbit equivalence relation in- 
duced by a Borel action of G on some standard Borel space. Is Eg hy- 
perfinite? 

Since every countable Abelian group is homomorphic image of Q"*", 
an affirmative answer to Problem 1 would imply an affirmative solution 
for this. In turn, if this holds. Problem 2 would also be solved in the af- 
firmative. Moreover, for any countable group El, every orbit equivalence 
relation induced by a Borel action of H on some standard Borel space 
Borel reduces to the equivalence relation on P(Z x H), the power set of 
7jX H, induced by the translation action ofExH. Since Q+ ~ Z x Q+, 
Problem 3 is actually equivalent to the following instance of it. 

4. [JacKecLou02] Translation action o/Q’*'. Let E be the orbit equiv- 
alence relation induced by the translation action 

Q+ X P(Q+) ^ P(Q+) 

(g, A) ^ qA = {qa}a&A- 

Is E hyperfinite? 

Note that, thinking of positive real numbers as Dedekind cuts, M+ C 
P(Q+), so the commensurability relation of Problem 2 is just the restric- 
tion of this E to a subspace. 

2.3 Universal countable Borel equivalence relations 

Definition 2.12. A universal countable Borel equivalence relation is a 
countable Borel equivalence relation E^o such that, for all countable 
Borel equivalence relations E, the inequality E <b E^o holds. 

Theorem 2.13. There is a universal countable Borel equivalence rela- 
tion. [DouJacKec94] 

Examples. 1. [DouJacKec94] The universal equivalence relation usually 
denoted by E^o is defined as the orbit equivalence relation induced by the 
free group E 2 on two generators on its power set V{E 2 ) by translation: 
for g e F 2 , A e V{E 2 ),gA = {gajaeA- 
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2. [And2CamHjo01] For n > 2 let =” be the equivalence relation on 
the Cantor space defined by letting x,y G be equivalent if there is 
a recursive permutation (p such that xip = y. So this equivalence relation 
is induced by a right action on of the group of recursive permutations. 
For n > 5, =” is a universal countable Borel equivalence relation. 

3. [And2CamHjo01] If G is a countable group containing a copy of 
F 2 , then conjugacy equivalence relation on subgroups of G is a universal 
countable Borel equivalence relation. 

4. [Cam02] The equivalence relations of recursive isomorphism on 
countable trees, groups, Boolean algebras, fields, total orderings are all 
universal countable Borel equivalence relations. This is the countable 
analogue of results of S'oo-universality which will be discussed later. 



Open problems 1. [JacKecLou02] Universality of free actions. Are there 
a countable group G and a free Borel action of G on some standard Borel 
space inducing a universal countable Borel equivalence relation? 

Note that the naive attempt to consider the free part of the translation 
action of F 2 on V{F 2 ) is fruitless here, since equivalence relations in- 
duced by free Borel actions of F 2 are treeable, a much simpler class than 
the universal ones. 

2. [DouKecOO] Turing degrees. Is Turing equivalence =t a universal 
countable Borel equivalence relation? 

A positive answer here would imply the failure of a conjecture of 
Martin’s (see the appendix in [KecMosiVS]). 

3. (Hjorth) Upward universality. Let E, F be countable Borel equiv- 
alence relations. Suppose E C F and E be universal. Is F universal? 

This is another interesting structural problem. A positive answer here, 
together with Example 2, would solve Problem 2 in the affirmative. Note 
however that, by results of Adams, for E, F countable Borel equivalence 
relations, E C F does not imply E <b F. 

4. [DouKecOO] Recursive permutations of binary sequences. Recall 
the notation from Example 2. Is =1 universal? 

It would be surprising if the number 5, which is the least n for which 
=” has been proved to be universal so far, had an essential role. 
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2.4 The structure of countable Borel equivalence relations 

The relation <b on countable Borel equivalence relations was linear up 
to Eq, as depicted in (2). However the general structure is very different 
as shown by the following theorem of [AdaKecOO]. 

Theorem 2.14. There is a map A i— > Ea assigning to each Borel subset 
A C M a countable Borel equivalence relation Ea such that, for all Borel 

A,B CR, 

A ^ B Ea Eb- 



Open problems 1. Local structure. Theorem 2.14 says that there is no 
nice general theory of the structure of <b on countable Borel equiva- 
lence relations. However it would be interesting to investigate some local 
behaviour of this partial order, like points with immediate successors or 
predecessors, the structure of cones Ce = {E \ E <b E}, maximal sets 
V of pairwise incomparable elements etc. 

2. Natural examples. The equivalence relations Ea produced in The- 
orem 2.14 are built with ad hoc contructions. On the other hand it would 
be nice to find more examples of natural countable Borel equivalence 
relations pairwise non Borel bireducible. 

3. [JacKecLou02] Essential countability. Let E be an essentially 
countable Borel equivalence relation. Is there a countable Borel equiv- 
alence relation E such that E ~b FI 

3 Classification by countable structures 

An equivalence relation on a standard Borel space is classifiable by count- 
able structures if it is Borel reducible to the relation of isomorphism on 
some Borel class of countable structures. Such a relation is Soo-universal 
if any equivalence relation induced by a Borel action of a closed sub- 
group of S'oo Borel reduces to it (recall that isomorphism is in fact in- 
duced by a continuous action of S'oo). 

Theorem 3.1 (Becker, Kechris). There are Soo-universal equivalence 
relations. [BecKec96] 
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Examples. 1. [MekSl], [FrioStai89], [CamGaoOl] The relations of iso- 
morphism on countable groups, trees, total orderings, fields. Boolean al- 
gebras are S'oo-universal equivalence relations. 

2. [GaoKec03] Isometry on 0-dimensional locally compact Polish 
metric spaces and on ultrametric Polish spaces are S'oo-universal equiva- 
lence relations. 

Open problems 1. [FrioStai89] Cofinality of trees. Let E be classifiable 
by countable structures and suppose that, for each a G cui, isomorphism 
of well founded trees of rank < a is Borel reducible to i?. Is i? an S'oo- 
universal equivalence relation? 

2. [Cam02] Isomorphism and recursive isomorphism. In connection 
with Example 4 from Section 2.3 and Example 1 above, let C be a Borel 
class of countable structures and suppose isomorphism on C is S'oo-univer- 
sal. Is recursive isomorphism on C a universal countable Borel equiva- 
lence relation? 

3. Natural examples. Eind other interesting classes of countable struc- 
tures whose isomorphism relation is S'oo-universal. Similarly, find classes 
of Polish spaces such that the relations of homeomorphism or isometry 
are S'oo-universal. 
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Abstract. In combinatorial optimization one typically wants to minimize some 
cost function given a set of constraints that must be satisfied. A classical exam- 
ple is the so called Traveling Salesman Problem (TSP), in which we are given 
a set of cities with certain distances between them and we are asked to find the 
shortest possible tour that visits every city exactly once and then returns to the 
starting point. This problem is believed to be hard to solve exactly — it has been 
shown to belong to a certain class of notoriously hard problems, the so called 
NP-hard problems. Despite huge efforts, there is no currently known efficient 
algorithm that solves any of the NP-hard problems and it is widely believed that 
no such algorithm exists. However, it is always possible to find an approximation 
to the optimum tour by solving a much easier problem, the so called Minimum 
Spanning Tree problem. This is an example of a general paradigm in the field of 
approximation algorithms for optimization problems: Instead of solving a very 
hard problem we solve an easy one and then convert the optimal solution to the 
easy problem into an approximately optimal solution to the hard one. Obviously, 
it is important to be careful when converting the optimal solution to the easy 
problem into an approximately optimal solution to the hard one. In many appli- 
cations, the analysis of the algorithm is greatly simplified by the use of random 
selections. Typically, we then use information obtained from the solution to the 
easy problem to bias our selection of a solution to the hard one. 
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1 Introduction 

A profound question in theoretical computer science is to determine how 
hard it is to solve certain given problems. The definition of “hard” in this 
context varies from application to application, but a common definition 
is to estimate the amount of time needed to solve the problem. It has been 
widely accepted that a running time that can be bounded by a function 
that is a polynomial in the input length is a robust definition of “reason- 
able running time”; the class of all problems that can be solved in poly- 
nomial time is denoted by P. However, there also exists a large class of 
decision problems, i.e., problems that either accept or reject a given input, 
with the property that an affirmative answer can be verified in polynomial 
time with the aid of a proof; this class is denoted by NP. Consider the 
following problem: Given a Boolean formula on n variables, accept if 
it is satisfiable, i.e., if there exists any truth assignment to the variables 
such that the formula evaluates to True. Clearly, a satisfying assignment 
to the variables is a proof of the fact that a formula is satisfiable. Such a 
proof can be verified in polynomial time by direct substitution in the for- 
mula. It is, however, not known how to find a proof in polynomial time. 
We can solve the above problem by trying all possible proofs but that 
takes time exponential in n. The above problem is in fact contained in a 
subset of NP, consisting of the so called NP-complete problems. Those 
problems are equally hard in the sense that if one of them can be solved 
in polynomial time, then all of them can. Since nobody has been able to 
construct a polynomial time algorithm for any NP-complete problem, it 
is widely believed that no such algorithm exists. In his paper [Lev73], 
Levin wrote (although he did not use the term NP-complete): 

[The NP-complete] problems can be solved by the trivial algorithm that consists of 
exhaustive search in the solution space. However, these algorithms require exponential 
time and mathematicians believe that there are no faster algorithms for such problems. 

[. . . ] (Eor instance, it has not yet been shown that finding a mathematical proof requires 
more time than it takes to verify its correctness.) 

This essentially summarizes the state of affairs also today. In fact, what 
is written above is exactly the P = NP question, the question whether 
it takes more effort to find a proof than it does to verify its correctness. 
During the first years after the introduction of the NP-completeness con- 
cept, several problems with wide use in applications were shown to be 
NP-complete (cf. [GarJoh79]). 
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2 Optimization Problems 

While optimization problems strietly speaking eannot belong to NP 
sinee they are not deeision problems, they ean often be shown to be at 
least as hard as some NP-eomplete problem. In that ease they are ealled 
NP-hard and eannot be solved in polynomial time if P NP. However, 

it is often enough to know that a solution is roughly the best possible. For 
an optimization problem known to be NP-hard, the following question is 
therefore a natural one to study: Is it possible to find, in polynomial time, 
a solution elose to the optimum? For a large elass of problems, a eertain 
sehema has been very sueeessful for designing provably good polyno- 
mial time approximation algorithms: First solve an easier optimization 
problem exaetly and then extraet an approximate solution to the harder 
problem from the optimal solution to the easier one. In this paper, we il- 
lustrate this sehema by deseribing approximation algorithms for the two 
NP-hard problems Maximum Satisfiability and Maximum Cut. We then 
eonelude by stating some open problems. 

Definition 2.1. A CHV-clause over Boolean variables is a eolleetion of 
literals, i.e., variables or their negations. A elause is satisfied by an as- 
signment to the variables if the assignment sets at least one of the literals 
in the elause to True. 

For instanee, one example of a CNF-elause over {xi, X2,xfi\ is x\\/ X2\f 
3 ^. The bar over x^, indieates that xs is negated. The elause a; i V 0:2 V 
is satisfied if either x\ or X2 is True or if x^, is False. 

Definition 2.2. Maximum Satisfiability is the following problem: Given 
a eolleetion of Boolean variables, a eolleetion of CNF-elauses over those 
variables, and a eolleetion of positive weights eorresponding to eaeh 
elause respeetively, find a truth assignment to the Boolean variables that 
maximizes the total weight of the elauses eontaining at least one true 
literal. 

For instanee, the following eolleetion of elauses define an instanee of 
Maximum Satisfiability over {xi,X2, x^, X4} (let us give the same weight 
to all elauses for the sake of simplieity): 



Xi V 0:2 V X3, a; 1 V 0:2 V 0:3, X\ V X4 

Xi V T2 V T3, V T2 V Xa, Xa V T4. 
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Fig. 1. The graph shown above is an example of an instance and a feasible solution for the Maxi- 
mum Cut problem. If the nodes are partitioned into black and white nodes as above, all but six of 
the edges connect a white vertex with a black one. 



Definition 2.3. Maximum Cut is the following problem: Given a graph 
and a collection of positive weights corresponding to each edge respec- 
tively, find a partition of the vertex set of the graph that maximizes the 
total weight of the edges whose endpoints belong to different parts in the 
partition. 

Definition 2.4. A randomized c-approximation algorithm for a maxi- 
mization problem is a polynomial time algorithm that for any instance 
outputs a solution with expected weight at least c times the value of the 
optimum solution. Such an algorithm has expected performance ratio c. 

3 A Naive Probabilistic Algorithm 

Let us study the following heuristic for Maximum Satisfiability over 
the Boolean variables {xi, T 2 , . . . , For each variable Xi in the in- 
stance independently set to True with probability 1/2 and False with 
probability 1/2. For each clause Cj, we introduce the indicator random 
variable Yj for the event that Cj is satisfied. If we denote the weight of 
the clause Cj by Wj we can write the weight of the satisfied clauses as 
W = WjYj if there are m clauses. Once we have used two im- 
portant facts, it is rather straightforward to compute E[fF], the expected 
weight of the satisfied clauses. Firstly, expectation is a linear operator. 
Secondly, the expected value of an indicator random variable is the prob- 
ability that the variable is 1. Thus, 

m m 

EIU-] = «My,] = E PhL = i|- 

i=i i=i 
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Notice that this means that we can compute Pr[l^ = 1] separately, re- 
gardless of the dependence between different Yj, which vastly simplifies 
the analysis. The key observation now is that for any fixed clause there is 
only one assignment that does not satisfy the clause: The assignment that 
sets all literals in the clause to False. A literal can be either a variable 
or a negated variable, but in both cases the probability that the literal 
is False is 1/2. Since we assign values to the variables independently, 
the probability that all literals in the clause Cj are false is where 

\Cj\ denotes the number of literals in the clause Cj. Thus, the clause is 
satisfied with probability 1 — and we can write the expected weight 
of satisfied clauses as 



E[1T] =^Wj(l-2-l^^'l). (1) 

i=i 

To prove an expected performance ratio, we must somehow relate the 
above expression to the optimum value. A very crude upper bound on the 
objective value is since the optimum value is always at most 

the total weight of the instance. Similarly, a very crude lower bound on 
the factor 1 — 2“!'"^ I is 1/2, corresponding to \Cj\ = 1, which implies 
that E[kF] > ^ '^j- We have now proved the following theorem: 

Theorem 3.1. The algorithm described in Section 3 is a randomized 
approximation algorithm for Maximum Satisfiability. 

The above algorithm is usually attributed to Johnson. For the case when 
all clauses have the same weight, a derandomized version of the algo- 
rithm, i.e., a version where the random selections have been replaced by 
deterministic ones in such a way that the quality of the solution does not 
decrease, is exactly what Johnson calls Algorithm B2 in his paper [Joh74], 
although he does not state the algorithm in terms of a probabilistic con- 
struction. 

4 The Plan: Relaxations and Randomized Rounding 

The algorithm from Section 3 seems to be a rather naive heuristic since 
it always selects truth values according to the same distribution, regard- 
less of the instance. It seems reasonable that it would help to skew the 
distribution towards the optimal assignment. Moreover, we very crudely 
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bounded the optimal value by the total number of clauses in the instance 
when we analyzed the algorithm. When the instance is almost satisfiable, 
this bound is quite good, but for instances were the optimum value is 
far from the number of clauses, we need a better upper bound. At first, 
it seems hard to achieve any of the two improvements suggested above, 
since it is NP-hard to compute the true optimum value. However, it turns 
out that it is in many cases fruitful to use the concept of relaxations from 
combinatorial optimization [Lue73]. 

Definition 4.1. Let P be an optimization problem of the form “maxi- 
mize $(?/) subject to y G C”. A problem P' , of the form “maximize 
subject to y G C"”, is a relaxation of P if C C C" and ^'{y) > 
^{y) for all y E C. 

A relaxation alone provides us with an upper bound on the optimum 
value. But we are usually interested in more than this; we also want to 
obtain a feasible solution with weight close to the optimum value. For 
instance, in an approximation algorithm for Maximum Satisfiability, just 
knowing that it is possible to satisfy many clauses is not as interesting 
as obtaining an assignment to the variables satisfying many clauses. The 
potential problem of solving a relaxed version of the problem is that we 
usually end up with a solution that is not feasible for the original problem. 
To convert the optimal solution to the relaxed problem into a feasible so- 
lution to the original problem, a procedure called randomized rounding 
is often used. Typically, the combinatorial optimization problem is first 
expressed as an integer program, i.e., a program of the form “maximize 
some given function of integer- valued variables subject to a given collec- 
tion of constraints of those variables”. This program is NP-hard to solve 
optimally, but it is possible to relax the program into something that is 
solvable in polynomial time. Then the solution to the relaxed problem 
is used to obtain a probability distribution according to which a feasible 
solution to the original problem is selected. 

To give a bound on the performance, we use a two-step argument: 
Firstly, we know that the optimum value of the relaxation of a maximiza- 
tion problem is an upper bound on the optimum value of the original 
problem. Secondly, we prove that the solution selected in the random 
process has an expected weight at most some factor c from the optimum 
value of the relaxed problem. Consequently, the solution selected has an 
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expected weight at most a factor c from the optimum weight of the orig- 
inal problem. 

We exemplify the above procedure below by describing a linear re- 
laxation of Maximum Satisfiability [GoeWili94] and a semidefinite re- 
laxation of Maximum Cut [GoeWili95]. 

5 Relaxations via Linear Programming 

Definition 5.1. A linear program in standard form is written 
minjc^a; : a; > 0 A Vi = 1, . . . , m{ajx = 6j)|, 

where ai, . . . , a^, c, and x are n-dimensional real vectors and bi,. . . ,bm 
are real scalars. 

It is possible to express inequality constraints by using additional vari- 
ables. The constraint OiX > bi can be rewritten as OiX — Zi = bi together 
with Zi > 0, where Zi is a new scalar variable. 

Finally, we remark that a linear program in n variables can be solved 
optimally in time 0{n^L), where L is the number of bits needed to store 
the program [Ye91]. 

5.1 A Linear Relaxation of Maximum Satisfiability 

We start by formulating Maximum Satisfiability as an integer program. 
For for each variable Xi we introduce an indicator variable y* for the 
event that Xi is True and for each clause Cj we introduce an indicator 
variable Zj for the event that Cj is satisfied. We also introduce for each 
clause Cj the sets Tj, containing the variables appearing unnegated in Cj, 
and Fj, containing the variables appearing negated in Cj. Finally we de- 
note the weight of the clause Cj by Wj. Then the optimum of the Max- 
imum Satisfiability instance is given by the optimum of the following 
integer program: 

m 

maximize WjZj 
i=i 

subject to X] f/i + - Vi) > Zj 

i&Tj i&Fj 

Hi e {0, 1} for all i = 1, . . . , n, 

Zj e {0, 1} for all j = 1, . . . , m. 



(2) 




84 



LARS ENGEBRETSEN 



To see this, remember that the value of Hi deeides the truth assignment 
to the eorresponding Xi. Then notiee that Zj = 0 only when = 0 for 
all i E Ti and yi = I for all i e Fi, i.e., when all variables appearing 
unnegated in Cj are False and all variables appearing negated in Cj are 
True. As soon as any of the unnegated variables are True or any of the 
negated variables are False, 

+ > 1 , 
i&Tj i&Fj 



and Zj assumes the value 1, since all weights are positive. 

Since it is NP-hard to solve the above integer program, we relax the 
constraints on and Zj to obtain the following linear program: 



m 



maximize 



E 



WjZj 



i=i 

subject to X] l/i + - yi) 

i&Tj i&Fj 

0 < l/i < 1 for all i = 
0 < Zj <1 for all j = 






(3) 



To see why this is a relaxation, note that the set of feasible solutions to 
this program contains all solutions feasible to the integer program. For 
those solutions the objective functions are identical in the two programs. 
We denote the optimum solution to the relaxed problem above by y* 
and z*. There is a y* for each variable Xi and a z* for each clause Cj and 
the optimum value of the relaxed problem is 



m 

Wl = 

i=i 

Similarly, we denote the optimum value of the integer program by wj. 
Note that although we do not know the optimum to the integer program, 
we do know that wl > wj, by the definition of a relaxation. 
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5.2 Rounding with Independent Random Assignments 

Given an optimal solution to the relaxed program, we select a truth as- 
signment to the variables xi as follows: Independently for each i, set 



Xi = 



True 

False 



with probability y*, 
with probability 1 — y*. 



(4) 



Let us now analyze how many clauses this assignment satisfies on aver- 
age. Introduce an indicator random variable 



1 if Cj is satisfied, 
0 otherwise. 



(5) 



Then we can use these random variables to compute the weight of our 
randomly selected solution as 



m 

i=i 

As in Section 3, we can express E[W] as 

m m 

E|W'l = 11- 

i=i j=i 

What remains is to bound Pr[Zj = 1]. We do this by bounding Pr[Zj = 
0] . The only way the clause is left unsatisfied is if all literals in the clause 
are false. Since the variables are assigned truth values independently, 

PrlZj = 0| = n<l-!'.')n!'.*- 

i&Tj i&Fj 



By the arithmetic/geometric mean inequality, 

Oi -f 02 -f ■ ■ ■ -f Ofc 

k 

for non-negative real numbers oi, 02 , ... , a^. Since 0 < y* < 1, we can 
use the arithmetic/geometric mean inequality to bound Pr[Zj = 0] by 




E*6T,(i-i/r) + E 



zeFj Vi 



\Cj\ 



\c, 



Ei[Zj = 0] < 
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If we rearrange the terms in the sums, we obtain 



Pr[Z, =0]< 1- 



Vi + Vi) 



m 



Sinee XlieT Vi + ^ bound on Zj from the linear 

program (3), 

Pr[Z, = 1] = 1 - Pr[Z, = 0] > 1 - (l - . 

This expression ean be bounded from below by l3\Cj\z*, where Pk = 
1 — (1 — To see this, it is enough to notiee that the eontinuously 
differentiable funetion 

f{z) = 1 - (^1 - I) - PkZ 

is zero when G {0, 1} and that its seeond derivative 



riz) 



k-l/ 
k V ~ k) 



is negative when 2 : G [0, 1]. This implies that / is eoneave on [0,1], whieh 
in turn implies that f{z) > (1 — z)f{0) + zf{l) = 0 on that interval. 
Therefore, 

m 

E[W]>J2P\c,n^l ( 6 ) 

j=l 

Sinee < 1 — 1/e, we ean reformulate this as 



E[1P] > ( 1 - - 
e 



m 

i=i 





Wl- 



Finally, we use the faet that wl is the optimum value of a relaxation 
of Maximum Satisfiability. This means that wl > wj, whieh together 
with the above bound shows that E[W] > {1 — l/e)wi, i.e., that the 
above algorithm is a randomized (1 — 1/e) -approximation algorithm for 
Maximum Satisfiability. 
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5.3 A Probabilistic Combination of Two Algorithms 

To obtain an approximation algorithm for Maximum Satisfiability with 
expected performance ratio better than 1 — 1/e we need to use a re- 
finement of the above technique. We notice that the naive algorithm 
from Section 3 actually has good performance when there are only long 
clauses in the instance, while the above algorithm has good performance 
when there are only short clauses in the instance. Let us now combine 
these two algorithms as follows: For a given instance, we run both algo- 
rithms. This gives us two solution candidates. As our solution, we select 
the candidate giving the largest weight. By the expression (1) and the 
bound (6), the expected weight of the solution produced by this algo- 
rithm is 

{ m m 

i=i i=i 



To bound this quantity, we note that max{ai, 02 } > (oi -f a2)/2, which 
enables us to write 



EIH-] > 



U14 






i=i 



sE 



WjZj 



l_2-l^d+/5|c, 



where the last inequality follows since 0 < 2 :* < 1. It is therefore enough 
to prove that 1 — 2“^ + /9fc > 3/2 for each k to obtain a | -approximation 
algorithm. This bound holds with equality when k = 1 or k = 2, and 
1 — 2“^ > 2 — 1/8 — 1/e > 3/2 when k > 3. Thus, we have proved 

the following theorem: 

Theorem 5.2. The algorithm described in Section 5 is a randomized |- 
approximation algorithm for Maximum Satisfiability. [GoeWili94] 

The first randomized | -approximation algorithm for Maximum Satisfia- 
bility was constructed by Yannakakis [Yan94], and the above algorithm 
was constructed after the algorithm of Yannakakis was first presented. 



6 Relaxations via Semidefinite Programming 

Definition 6.1. A square matrix A is called positive semidefinite (PSD) 
if X ■ Ax > 0, where ■ denotes the inner product, for all x gMA. 
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There are many equivalent definitions of a semidefinite program in the 
literature. To state one, we need to define the inner product between ma- 
trices, A*B = CLijhj- 

Definition 6.2. A semidefinite program on v? variables is a program of 
the form minx{C • X : X is PSD A Vi = 1, . . . , m{Ai • X = bi)}, 
where Ai, , A^, C, and X are real n x n-matrices and 6i, . . . , 6^ are 
real scalars. 

Given that certain technical requirements -which we will not delve into 
here- are satisfied, a semidefinite program on n variables can be solved 
within an additive error of e in time polynomial in n and log ^ [Ali95]. 

Lemma 6.3. Every linear program is also a semidefinite program. 

Proof. Suppose that the linear program is of standard form as in Defini- 
tion 5.1. We can then transform it into a semidefinite program as follows: 
For a vector v let diag(L) denote the matrix that has the entries of v on the 
diagonal and zeros everywhere else. Set C = diag(c), and Ai = diag(aj) 
for i = 1, . . . ,m. Add constraints ensuring that X is diagonal to the 
semidefinite program. q.e.d. 

If A is positive semidefinite, there exists an n x n-matrix B such that 
A = BB^ . This matrix can be computed in time 0{rA) by incomplete 
Cholesky factorization [GolsvLoSS]. This computation is needed in the 
algorithms below to compute from an n x n matrix A = (au') vectors 
{vi, . . . , Vn} such that an' = Vi-vn. We can use the above fact to rephrase 
the definition of a semidefinite program in a way suitable for our appli- 
cations below. 

Definition 6.4. A semidefinite program on n vector-valued variables is 
a program of the form “maximize T(li, . . . , Vn) subject to \vi\^ = 1 for 
all i = 1, . . . , n and Cfivi, . . . ,Vn) > 0 for all j = 1, . . . , r” where the 
functions L and Cj are linear in the inner products ■ v^. 

6.1 A Semidefinite Relaxation of Maximum Cut 

Goemans and Williamson [GoeWili95] construct an approximation algo- 
rithm for Maximum Cut by studying a relaxation of an integer quadratic 
program. We will present their construction here as an example of a 
semidefinite relaxation combined with randomized rounding. For a graph 
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G = (y,E) with vertices V = {l,...,n},we introduce for each vertex i 
in the graph a variable y* G { — 1, 1}. If we denote by ww the weight of 
the edge (i, i'), the weight of the maximum cut in the graph is given by 
the optimum of the integer quadratic program 

E l — HiHi' 

Wii/ 

(i,i')eE ^ (V) 

subject to Hi e { — 1, 1} for all i. 

The partition (Vi, V 2 ) yielding the optimum cut can be formed as Vi = 
{i:Vi = 1} and V 2 = {i : Vi = -1}. 

It is NP-hard to solve (7) optimally. To obtain a polynomial time 
approximation algorithm, Goemans and Williamson [GoeWili95] con- 
struct a relaxation of the above integer quadratic program by using n- 
dimensional real vectors Vi instead of the integer variables yi. The prod- 
ucts yiyii are then replaced by the inner products Vi ■ Vii. This gives the 
semidefinite program 

E l — 

Wii' 

^ ( 8 ) 

subject to I Lip = 1 for all i. 

To see that this is a relaxation of the integer quadratic program above, 
we note that we can transform a feasible solution to the integer quadratic 
program into a feasible solution to the semidefinite program as follows: 
Let the first coordinate of Vi in the semidefinite program be the value 
of yi in the integer quadratic program; all other components of Vi are 
set to 0. Then the feasible solutions to the integer quadratic program are 
feasible also for the semidefinite relaxation, and the objective functions 
are identical for solutions feasible to the integer quadratic program. 

We denote the optimum solution to the relaxed problem above by v*, 
and the corresponding optimum value by 

* 

- 

Wii' . 

{i,i')£E 

Similarly, we denote the optimum value of the integer quadratic program 
by wq. Although we do not know how to compute it in polynomial time, 
we do know that ws > wq by the definition of a relaxation. 
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Fig. 2. When the vector r is in the shaded region, (v* ■ r)(v*i • r) < 0. If r is uniformly distributed 
in the plane, the prohahility that r is in the shaded region is 2t?/27r = 



6.2 Rounding with a Random Hyperplane 

To obtain a partition of the graph from the solution to the semidefinite 
relaxation, we select a random vector r, uniformly distributed on the unit 
sphere in M”, and set Vi = {i \ v* ■ r > 0} and V 2 = {i '■ v* ■ r < 0}. 
Vectors satisfying n* ■ r = 0 can be assigned a part in the partition arbi- 
trarily since they occur with probability zero. As before, we can write the 
expected weight of the solution produced by the randomized rounding as 

E[1V]= Wu'FT[{v*-r){v*,-r)<0], 



and we now try to bound the separation probability. Notice that the angle 
between v* and the projection of r into the plane spanned by v* and v*, 
is uniformly distributed, and thus 



Pr[(n* ■ r){v*, ■ r) < 0] 



arccos v* ■ v*, 

7T 



We want to bound this from below by some constant factor a times 
(1 — n* ■ v*,)/2 since then we can write 



E[W] >a 



I - V* ■ V*, 



Wjjf - 



which would show that the algorithm has expected performance ratio a. 
We compute a lower bound on a by considering all possible configura- 
tions of the vectors v* and v*,, i.e., we use the bound 

. 2 arccos v* ■ v*, 2 . § 

a = mm — ^ = — mm . 

7t(1 — V* ■ V*,) 7T i?e[0,7r] 1 — COS 17 
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Consider the function 



a(i9) 



2i9 

7t( 1 — cosi9) 



(9) 



which describes the approximation factor for an edge in the instance as 
a function of the angle between the vectors corresponding to the end- 
points of the edge. It is continuously differentiable in (0,7t], a(7r) = 1 
and lim^^o+a(i^) = oo; therefore a = min.^g(o,,r) Any local ex- 
treme point in (0,7t) satisfies a'(i9) = 0. This equation can be solved 
numerically to obtain that a ~ 0.87856. It is also possible to prove that 
a > 0.87856, which shows that 



E[W] > 0.87856 ^ wu >- — > 0.87856ws > 0.87856 wq. 



To conclude, we have now proved the following: 

Theorem 6.5. The algorithm described in Section 6 is a randomized 
0.87856-approximation algorithm for Maximum Cut. [GoeWili95] 



7 Directions for Future Research 

We have presented algorithms for the Maximum Satisfiability and Max- 
imum Cut problems using a general framework: relaxations combined 
with randomized rounding. We now conclude by stating some open prob- 
lems related to the results presented above. 



7.1 Purely Combinatorial Algorithms 

Relaxations and randomized rounding give us a framework for construct- 
ing approximation algorithms. However, it is not clear that they help us 
understand the structure of the problem. Maybe we could get a better 
understanding of why certain problems are more difficult than others 
by constructing purely combinatorial approximation algorithms with the 
same performance ratio as the currently best known algorithms. 
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1.1 The Approximability of Maximum Satisfiability 

The best known algorithm for Maximum Satisfiability that is based on 
linear programming and randomized rounding is the |- approximation al- 
gorithm presented in Section 5. Is there a better one, or can we prove 
that 3/4 is the best possible ratio such an algorithm can achieve, un- 
less P = NP? As a comparison, the currently best known algorithm, 
which uses a mixture of several different techniques, has expected perfor- 
mance ratio 0.7846 {cf. [AsaWiliOO]) and there is an algorithm based on 
semidefinite programming and randomized rounding with an expected 
performance ratio of 7/8 for Maximum 3-Satisfiability, z.e.. Maximum 
Satisfiability restricted to instances where the clauses have at most three 
variables {cf. [KariZwi97]). 

7.3 The Approximability of Maximum Cut 

The currently best known algorithm for Maximum Cut is in fact the 
0.878-approximation algorithm presented in Section 6. It is known that 
the analysis of the algorithm is tight {cf. [Kari99], [FeioSchoOl]) and that 
it is NP-hard to approximate Maximum Cut within 16/17 ~ 0.942 {cf. 
[HasOl]). Can we close this gap, i.e., either provide a better approxima- 
tion algorithm or a better approximation hardness result? 

7.4 Special Cases 

Another promising direction for future research is to study special classes 
of input instances. One such class is defined by so called “almost satis- 
fiable” instances, which for Maximum Cut amounts to instances that are 
almost bipartite, i. e . , where it is possible to cut a fraction 1 — e of the edges 
for some small e. By studying the function a{{f) defined in equation (9) 
when -d is close to 1 it is possible to see that the algorithm presented 
in Section 6 cuts a fraction 1 — of the edges in such instances. 

Zwick in [Zwi98b] shows similar results for Maximum 2-Satisfiability 
and Maximum Horn Satisfiability. 

7.5 Optimal Approximation Hardness Results 

For some optimization problem, just guessing an assignment uniformly 
at random works gives an approximation algorithm. For instance, this 
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simple algorithm gives expected performance ratios of 1/2 for Maxi- 
mum Cut and 1 — 2“^ for Maximum E/c-Satisfiability — ^Maximum Sat- 
isfiability where clauses contain exactly k literals. For Maximum Cut 
and Maximum E2-Satisfiability, respectively, there are better polyno- 
mial time approximation algorithms, with expected performance ratios 
of 0.878 and 0.931, respectively [GoeWili95], [FeioGoe95]. However, 
it is NP-hard to approximate Max E/c-Satisfiability when k > 3 within 
1 — 2“^ -l-e for any constants > 0 [HasOl]; we call such problems approx- 
imation resistant. Which problems are approximation resistant? For con- 
straints on two or three Boolean variables, this problem has been solved: 
It is known that none of the constraints on two Boolean variables is ap- 
proximation resistant -this is a corollary of the Goemans-Williamson 
algorithm {cf. [GoeWili95])- and that the approximation resistant con- 
straints on three Boolean variables are exactly those implied by parity 
{cf. [HasOl], [Zwi98a]). For constraints on two variables over some finite 
domain, the situation is not completely resolved. There are algorithms es- 
tablishing that many such constraints are not approximation resistant {cf. 
[AndoEngHasOl], [EngGur02], [Fri2Jer97], [GoeWiliOl]). In fact, the 
only natural constraint on two non-Boolean variables whose status with 
respect to approximation resistance has not yet been resolved is the gen- 
eralization of E2-Satisfiability, i.e., the constraint {x\ ci) V {x 2 f C 2 ). 
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0 Introduction and outline of the paper 



The technique of forcing was invented by Paul Cohen {cf. [Coh63]) in or- 
der to produce a model of the commonly accepted system of set-theoretic 
axioms, ZFC, the Zermelo-Fraenkel axioms ZF together with the axiom 
of choice, in which the Continuum Hypothesis (CH) fails. The existence 
of such a model shows that CFI does not follow from ZFC. Earlier it was 
shown by Gddel {cf. [KunSO, Chapter VI]) that CFI cannot be refuted 
from ZFC, assuming of course that ZFC is consistent. 

Forcing turned out to be a very powerful tool for proving indepen- 
dence results in mathematics. Starting from a model V, the ground model, 
which satisfies ZFC, a partial order P G V is fixed which codes the de- 
sired properties of the model one wishes to construct. Then forcing with 
P over V yields a generic extension V[G] where all this information is 
decoded and ZFC holds as well. V[G] is obtained by adding a certain 
generic object G (a subset of P) to V. The model V[G] is the smallest 
model of ZFC which extends V and contains G. 

Sacks forcing is one such partial order and was invented by Gerald 
Sacks to produce a minimal forcing extension V[G] in the following 
sense: If FE is a model of ZFC such that V C W C V[G], then ei- 
ther W = V or W = V[G]. In this article we give an overview of the 
properties and applications of Sacks forcing. 

In Section I we illustrate the idea of forcing by an example. The 
reader who is interested in learning the details of forcing is referred to 
[KunSO], [JecVS], or [Jec02]. Our notation follows [KunSO] and when- 
ever we state a fact about forcing without mentioning a source, this fact 
can be found in [KunSO] or, if Borel sets and descriptive set theory are 
involved, in [JecVS]. This section is ment as a sketchy introduction to 
forcing, and it certainly does not replace some detailed treatment as in 
the books mentioned above. 

After that, in Section 2, we introduce Sacks forcing and study some 
of its properties. We isolate the so-called Sacks property and mention 
some of its consequences in Section 3. Section 4 is devoted to relatives of 
Sacks forcing which share some of its important features. Sacks forcing 
does not satisfy the countable chain condition (c.c.c.). So in Section 5 we 
wonder whether there is any c.c.c. forcing that has the Sacks property. 
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We then turn to iterations of Saeks foreing. Seetion 6 deals with 
eountable support side-by-side produets of Saeks foreing and in Seetion 7 
we diseuss the eountable support iteration of Saeks foreing of length UO 2 . 

1 Collapsing the continuum to 

When Godel proved that CH is eonsistent with ZFC, he started from a 
model V of ZFC and eonstrueted a definable elass L, the elass of eon- 
struetible sets, inside V sueh that L satisfies ZFC-fCFI. (Aetually, Godel 
started from a model of ZF and produced a model of ZFC-fCFI.) Forcing 
works the opposite way. We start from a model V of ZFC and extend it. 
As usual, we will frequently talk about models of set theory, which just 
means models of ZFC. Whenever we consider two models Vq, of set 
theory such that Vq C Vi, we will tacitly assume that Vq is transitive in 
Vi, i.e., for all x e Vq and y e Vi, if y e x in Vi, then y G Vq. This guar- 
antees that Vq and Vi agree on elementary properties of sets, for instance 
whether a set t G Vq is an ordinal or not.^ 

Suppose we wish to construct a model of ZFC + CH by forcing. Let 
L be a model of ZFC. If V satisfies CH, we are done. Otherwise, in V 
the cardinal the size of M and of p{cu), is bigger than Ki. In this case 
we try to add a function / ; Ki — p{u) which is onto. Since V does not 
satisfy CH, no such function exists in V. However, for every countable 
ordinal a, f\a could be an element of V. We define a. forcing notion 
P, i.e., a partial order, which consists of possible initial segments of a 
surjective map / ; Ki — p{u). 

Let P ;= {p : ran(p) C & 3o; < o;i(dom(p) = a)}. The order 
on P is reverse inclusion. More precisely, for p, g G P let p < g iff g C p, 
i. e.,ifp extends g. If p < g, we say that p is stronger than g. The elements 
of P are the forcing conditions or just conditions. Two conditions po, pi G 
P are compatible iff there is g G P such that g < po,Pi- Otherwise po 
and Pi are incompatible. In this case we write po -L pi. Our particular 
forcing notion P is a tree of height (the root is the empty function, 
and stronger (i.e., smaller with respect to <) conditions are higher up the 
tree). In general, a forcing notion can be any partial order. For technical 
reasons, we assume that every partial order Q has a largest element 1q. 



Cf. [KunSO, IV.3] for more information on this subject. 



1 
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We now describe how we get a surjection / ; from P. A 

subset G C P is di filter iff 

- for all p,q G G there is r G G such that r < p,q and 

- for all p G G and all g G P with p < q, q G G. 

In our case, a filter in P is the same as a branch in the tree P. If G C P 
is a filter such that {a < ui : ran(p) C p(u) & 3p G G(dom(p) = 
a)} = Ui, then fc '■= IJ G is a function from Ki to p(cij). However, if 
G is an element of V, then fc is not onto p{u) since 2^° > Ki in V. 
This is where genericity comes into play. We have to make sure that G is 
sufficiently complicated. D C P is dense in P if for all p G P there is q in 
D such that q < p. A filter G C P is F-generic over V if it has nonempty 
intersection with every D eV which is dense in P. A set D C P is dense 
below a condition p G P if for all g G P with g < p there is q' G D with 
g' < g. It is not difficult to check that if D is dense below p and G is 
P-generic over V with p E G, then D f] G ^ 0. 

Except for trivial cases, filters that are generic over V cannot be ele- 
ments of V. Thus, we need someone outside V who chooses a P-generic 
filter over V. It can be shown that for each partial order Q in 1/ it is safe 
(i.e., it does not lead to a contradiction unless ZFC itself is contradictory) 
to assume that a Q-generic filter over V exists. More precisely, whenever 
we consider a specific forcing notion Q it will have a reasonable descrip- 
tion. If the existence of a forcing notion with this description is consistent 
with ZFC, then there is a model V of set theory such that in V we have 
a partial order Q' satisfying the description of Q and there is a model W 
of set theory such that V C W and in W there is a Q'-generic filter G 
over V. 

So, let G be P-generic over V. For every x G p{u) the set : = 
{p G P : a; G ran(p)} is dense in P. Since E V and G is generic, 
there is p G GflDa;, f.e., there is p G G such that x E ran(p). This implies 
that / ;= fc is a function onto p(u). Similarly, for every a < cui, the set 
D°‘ := {p G P : a G dom(p)} is dense in P. It follows that for every 
a < uji there is p G G such that a E dom(p). This implies dom(/) = Ki. 
It follows that / : p(co) is onto. 

After constructing the desired map / from G, we have to explain how 
to get a model V[G] of ZFC from V and G. Roughly speaking, I^[G] 
consists of all sets that are definable from G together with parameters 
from V. More precisely, a class of so-called F-names is defined in V. 
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Then, by means of some simple algorithm, from every P-name x E V 
a set xg is computed using G. The set xq is the evaluation of x with 
respect to G. The model V[G] consists of all xq- Every element y of V 
has a canonical F-name y with the property that for every P-generic G, 
yc = y- In other words, V is included in V[G]. Moreover, there is a name 
G such that for every P-generic filter G, Gq = G. V[G] turns out to be a 
model of ZFC, and it has the same ordinals as V. 

In V[G] we have the function / = fc, as defined above. The function 
/ has at least one name in V, so we can pick one and call it /. Thus, 
/ is the interpretation of the name / under G, and we denote this fact 
by / = fc. We argued that / : ^ p{uj) is onto. However, we were 

talking about the Ki and p{co) of V, not of V[G]. Formally, the symbols 
Ml and p{co) stand for definitions of certain sets. The set p{co) is the 
power set of the first limit ordinal and Ki is the first ordinal a such that 
there is no map from u onto a. As it turns out, u is the same in V as in 
V[G], and this holds for every forcing extension. Let and denote 
the first uncountable ordinal in V or V[G], respectively. Relativizations 
of power sets, i.e., p{uj)'^ and are defined similarly. In order 

to prove that V[G] is a model of CH it is enough to show 
and p{cu)'^ = Both of these statements follow from the fact 

that V[G] does not contain any countable sequences of ordinals that are 
not already elements of V (i.e., there are no new countable sequences of 
ordinals), which we prove in a moment. 

We need some information on how properties of V[G] are connected 
to the properties of P. Let p(xi , . . . , Xn) be a formula in the language 
of set theory (first order logic with the binary relation symbol G) with 
all free variables among x\, ... ,Xn. Then for every condition p G P and 
P-names xi,...,Xn we say that p forces <p(Ti, . . . , T„) iff for every P- 
generic filter G with p E G, V[G] satisfies (p((xi)g, • • • , {xn)G)-^ In this 
case we write p Ihp <p(Ti, . . . , Xn). It is crucial for the theory of forcing 
that the relation Ihp is actually definable in V. Moreover, we have the 
following important fact: 

Theorem 1.1 (The Forcing Lemma). Let Q be a forcing notion and 
suppose that G is Q-generic over V. Then 

y[G] h P{{xi)G,---,{Xn)G) 

^ Here (p(xi, . . . , Xn) denotes ip with every free Xi replaced by the name Xi. 
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{i.e., • • • , (in)c) is true in V^[G]) iff there is g G G sueh that 



q Ihp (p{xi, . . .,Xn). 



Thus, nothing in V[G] depends on ehanee, everything is foreed at 
some point. We say that a eondition p decides a statement 99 if p Ih ip 
or p Ih -i(p. Similarly, if i; is a name, we say that p decides x if there 
is some ground model set x such that p Ih x = x. It is worth noting 
that for conditions p and q with q < p, q forces everything that p forces. 
Moreover, for every condition p and every statement p there is g < p 
such that g decides p. 

Returning to our example, how can we see that there are no new 
countable sequences of ordinals? The forcing notion P is a-closed, that 
is, if {pi)i<uj is a sequence of conditions in P such that pi+i <p pi for 
each i, then there is some p G P such that p < Pi for every i E u. Just let 

Assume there is a P-name h and a condition p which forces that h is 
a function from u to the ordinals. Let p' < p be arbitrary. Then, using 
Theorem 1 . 1 and the remarks following it, we find a decreasing sequence 
of conditions such that go <p p' and for each i there is some 
ordinal a* such that g* Ihp h{i) = ai? 

Let g G P be such that q < qi for all i G cu. Now for all f G cu, g Ih 
h{i) = Gj. In other words, g forces that h is the sequence {ai)i(z^, which 
is an element of the ground model. Since q < p' and p' was arbitrary 
below p, the set of conditions that force /i to be a sequence in the ground 
model is dense below p. It follows that every generic filter G that contains 
p intersects this set. Thus, whenever G is P-generic over V and p G G, 
then he ^ V. It follows that for every P-generic filter G over V, in V[G] 
there are no new countable sequences of ordinals and thus, Ki and p{co) 
are the same in V[G] as in V. But this implies that V[G] is a model of 
ZFC-rCH since in V[G] there is a surjection from Ki onto p{co). 

It is worth mentioning that if p forces /i to be a function from u to the 
ordinals, then we may already assume that every condition g forces h to 
be a function from u to the ordinals. This is because we can replace h by 
a name g such that every condition g forces “g is a function from u to the 

^ Formally, the last part should have been stated as qi Ihp h{i) = 6ti since we have to use 
names to the right of the forcing relation. However, we identify the elements of V with their 
canonical names as long as it is clear what we mean. 




ON SACKS FORCING AND THE SACKS PROPERTY 



101 



ordinals and if p G G, then h = g"\ The existenee of g follows from the 
next theorem, the so-ealled Maximality Principle. 

Theorem 1.2. Let tp{x, pi, . . . , Vn) be a formula in the language of set 
theory. Let Q be a foreing notion, g G Q, and suppose that pi, . . . , are 
Q-names sueh that 

q Ih . . . ,p„). 

Then there is a Q-name x sueh that 

q Ih p{x,yi,...,yn). 

In partieular, if every g G Q forees 3x(p{x, pi, . . . , pn), then 1 q forees 
this and thus, by Theorem 1.2, there is a Q-name x sueh that every g G Q 
forees <p(T,pi, . . . ,p„). 

2 Sacks forcing and its properties 

The foreing notion whieh we introdueed in Seetion 1 for eollapsing 2^° 
to Ki was a rather speeial ease. As mentioned before, foreing was in- 
vented to enlarge 2^°, not to make it smaller. Enlarging 2^° ean be done 
by adding new reals, where a “real” is an element of one of the sets p{uj), 
2“, M, and [0, 1]. In our eontext, usually it does not matter whether 
we look at p{u), 2‘^, M, or [0, 1], and here is why: 

The first two spaees ean be identified in a natural way. The spaee 
is homeomorphie to the set of irrational numbers, a subset of M whose 
eomplement is eountable and therefore neglegible for our purposes. The 
eompaet unit interval is a eontinuous image of 2‘^ via a mapping that fails 
to be injeetive only on eountably many plaees, i.e., again on a neglegi- 
ble set. Finally, [0, 1] is homeomorphie to the two-point eompaetifieation 
of M. These mappings, whieh are almost bijeetions between the differ- 
ent spaees, are all nieely definable and show that onee we know all the 
elements of one of those spaees, we know all the elements of the other 
spaees too. Moreover, if we investigate some frequently studied ideals 
sueh as the ideal of measure zero sets, of meager sets, or of eountable 
sets, it turns out that, as far as the basie properties of these ideals eon- 
eerned, it does not matter on whieh inearnation of “the reals” the respee- 
tive ideals are eonsidered. 
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The forcing notion used by Cohen to add new reals is now called 
Cohen forcing and is defined as follows. Let C be the set of all finite 
sequences of O’s and I’s. The set C is ordered by reverse inclusion. If 
G is C-generic over the ground model, then the Cohen real added by 
C is just IJ ^ function from oj to 2. Thus, Cohen forcing uses finite 
approximations to create a new real. To enlarge the continuum using C 
one has to use iterate Cohen forcing in order to add many new reals. 
Iterated forcing will be discussed later. 

Let us consider another description of Cohen forcing. Whenever we 
consider a Boolean algebra B as a forcing notion, we mean the partial 
order (B\{0 b}, <) where < is the natural order on B. Let (P, <p) and 
(Q, <q) be two partial orders. A mapping e : P — Q is a dense embed- 
ding iff the following conditions hold: 

- Vp,p' e P(p <p p' ^ e{p) <Q e{p')) 

- Wp,p' e P(p ±p p' ^ e{p) Tq e{p')) 

- V, 6 Q3p € PP(p) <0 ,) 

Note that we do not demand e to be 1-1. But condition (ii) implies that 
for all p,p' G P with e{p) = e{p') and every P-generic filter G we 
have p G G iff p' G G. Every partial order P has a so-called comple- 
tion, a complete Boolean algebra B such that P densely embeds into 
(B\{0 b}, <). The Boolean algebra B is unique up to isomorphism. 

We call two partial orders P and forcing equivalent (or just equiva- 
lent) iff they have isomorphic completions. If P and Q are equivalent, 
then they produce the same generic extensions. Therefore, equivalent 
partial orders can be considered the same as far as forcing is concerned. 

Many forcing notions adding generic reals can be described in the fol- 
lowing way: Let Bor(M) denote the Boolean algebra of Borel subsets of 
the reals. If X C Bor(M) is an ideal, then Bor(M) /X is a Boolean algebra. 
As a forcing notion, this Boolean algebra is equivalent to (Bor(M)\X, C). 
Let M. denote the ideal of meager subsets of the real line. Then Cohen 
forcing is equivalent to Bor(M)/Af since Bor(M)/Af has a dense subset 
which is isomorphic to C. 

Another important ideal in Bor(M) is the ideal ff of Borel sets of 
measure zero. The forcing notion Bor(M) /ff is called random real forc- 
ing and adds a random real. Both forcing notions, Cohen and random 
real forcing satisfy the countable chain condition (c.c.c.), that is, they 
only have countable antichains. Here an antichain is a set of pairwise 
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incompatible elements. Foreing notions with the e.e.e. do not eollapse 
eardinals. More exaetly, if P is e.e.e. and G is P-generie over the ground 
model V, then an ordinal a is a eardinal in V[G] iff it is a eardinal in V. 

If X C Bor(M) is a a-ideal, then the generie real added by Bor(M) /X 
is determined as follows: We work with Bor(M)\X instead of Bor(M) /X. 
Let G be Bor(M)\X-generie over the ground model V. Then is 
nonempty and eontains a unique real xg, and this real is the generic real 
added by Bor(M) /X. While xq is not a generie filter for some partial 
order, G ean be reeovered from xq (i-e., G and xq are interdefinable), 
and thus it makes sense to talk about xq as the generie objeet added by 
Bor(M) /X. In a moment we will see how G ean be reeovered from xq- 

To deseribe the properties of xq, we have to mention Borel codes (cf. 
[Jee78, page 537]). A Borel set X CM is not merely a set of real num- 
bers, but it has a deseription of how it ean be obtained from open sets by 
taking eomplements and unions over eountable families. This deseription 
ean be eoded as an element of and this eode is a Borel eode for X. 
(Note that a Borel eode for a given set X is not unique.) 

In faet, usually mathematieians use Borel eodes when they are talking 
about Borel sets. E.g., they are talking about the open set (0, 1) instead of 
thinking about every single real inside this interval. The notation (0,1) 
ean be eonsidered as a Borel eode for a eertain open set. This Borel eode 
has different interpretations in different models of set theory. However, 
usually the strueture of a set matters, and the individual elements whieh 
are inside are unimportant. When we talk about a Borel set X in two 
different models of set theory, we really ehoose a Borel eode for X and 
then identify the two Borel sets that are obtained in the respeetive models 
by applying the definition of a Borel set eoded by the Borel eode. This 
does not depend on the partieular Borel eode we ehose for X. 

The generie real xq has the property that it is not an element of any 
element of X. More preeisely, let X G X. Then X is Borel and thus has 
a Borel eode. (We work in V, so far.) Now the Borel set in V[G] whieh 
has this Borel eode does not eontain xq- Roughly speaking, xq avoids 
all sets from X. 

Now G ean be reeovered from xq as follows: Let (Bor(M))^ de- 
note the set of ground model Borel sets. Then G is the set of all X G 
(Bor(M))'^ whieh in V[G] eontain xq- Sinee xq avoids all elements of 
the ideal X, the ground model Borel sets whieh in V[G] eontain xq are 
indeed elements of (Bor(M))'^\X. By the definition of xq, every ele- 
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merit of G contains xq (in V[G]). On the other hand, if X is a ground 
model Borel set which in V[G] contains xq, then X is an element of 
(Bor(M))'^\X which is compatible with all elements of G since the in- 
tersection of X with an arbitrary element of G is again a ground model 
Borel set that in V[G] contains xq- It is a general fact about generic filters 
that they contain a condition iff the condition is compatible with all the 
elements of the filter. This implies X G G.* 

One (T-ideal of Bor(M) stands out as the smallest a-ideal containing 
all the singletons, namely the ideal ctbl of countable sets. Sacks forcing 
is Bor(M) / ctbl. A Sacks real is a generic real for Sacks forcing. 

There is another ideal related to Sacks forcing, Marczewski’s ideal 
introduced in [Maro35].^ A set X C M is perfect iff it is nonempty, 
closed, and has no isolated points. A set X C M is in the ideal iff 
every perfect set P C M has a perfect subset Q C P which is disjoint 
from X. Note that no perfect set is in A set X C M is s-measurable 
iff for every perfect set P C M there is a perfect set Q C P such that 
either QfiX = 0orQCX. The set of s-measurable sets is called the 
Marczewski field. 

A well known theorem due to Alexandroff and Hausdorff says that 
every uncountable Borel set includes a perfect set (cf [Jec78, Theorem 
94]). It follows that every Borel set is s-measurable and that the Borel 
sets in s° are precisely the countable sets. Moreover, every s-measurable 
set X CM which is not in the ideal has a perfect subset. And perfect 
sets are clearly Borel. It follows that there is a dense embedding from the 
partial order Bor(M)\ctbl into the partial order of s-measurable sets that 
are not in s*^. This implies that the Boolean algebra Bor (M) /ctbl and the 
quotient of the Boolean algebra of s-measurable sets modulo the ideal s° 
have isomorphic completions and are thus forcing equivalent. 

We give a combinatorial description of Sacks forcing. First note that 
every perfect set X CM has a perfect subset which is homeomorphic to 
2‘^.^ It follows that the copies of 2“ are dense in Bor(M)\ctbl. Therefore 



Note that all the time we considered I as the actual set of ground model Borel sets, i.e., as an 
ideal in (Bor(R))'^. Usually T fails to be an ideal in Bor(R) in V (G). IfJ is nicely definable 
like A4 and X, then typically the ground model ideal J is different from the ideal in V[G] 
which is obtained by applying the definition of X in U[G]. 

^ Cf. [Brei95] for some information on Marczewski’s ideal and other similarly defined ideals. 

® Cf. [Jec78, Lemma 4.2] for this. The statement follows from the usual proof of the fact that 
every perfect set is of size . 
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forcing with the partial order of perfeet subsets of 2“^ gives the same 
generie extensions as foreing with Bor(M)/ctbl. 

Let X be a subset of 2“^. Then the set T{X) \= {x\n \ n & uj /\ x & 
X} of initial segments of elements of X is a subtree of the tree (2^^^, C). 
For a subtree T of 2<^^ let [T] := {t G 2^^ : Vn G ui{x fn G T)} be the set 
of branehes through T. For every subtree T of 2“^“^ the set [T] is elosed. 
For every elosed set X C 2“^, [T(X)] = X. In this translation between 
elosed subsets of 2“^ and subtrees of 2“^“^ the perfeet sets eorrespond to 
perfect trees. A nonempty subtree T of 2"^“^ is perfeet iff for every s eT 
there are to, G T such that s C to, t\ and to and ti are ineomparable 
with respeet to C. 

This gives rise to our offieial definition of Saeks foreing §. 

Definition 2.1. Saeks foreing is the set § of all perfeet subtrees of 2^^^ 
ordered by inelusion. 

With this definition of Saeks foreing, whenever we refer to a Sacks 
real we mean the unique element x of Hpec \p\ where G is an §-generie 
filter. If T is a Saeks real and this is witnessed by G, then G is the set of 
all perfeet trees p in the ground model sueh that t G [p]. 

It is quite obvious that Saeks foreing does not have the e.e.e. In faet, 
there are antiehains of size 2^°: Fix an almost disjoint family {A^ | a < 
2^°} of subsets of u, and for each a < choose a perfeet tree 
whose splitting levels are exaetly the elements of Aa, for example, := 
{s G 2<“^ : Vn < |s|(n ^ Aa ^ s{n) = 0)}. If a f (3, then Ta fl 
ineludes no perfeet tree, so they are ineompatible. Nevertheless, S does 
not eollapse Ki either, sinee it satisfies Baumgartner’s Axiom A [Bau83, 
Seetion 7]: 

Definition 2.2. A foreing notion (P,<) satisfies Axiom A iff there is a 
deereasing ehain of partial orders < = <o 5 5 <2 • • • on P such that 

1. for every sequenee {pn)n<uj in P sueh that for all n e u, pn+i <n Pn 
there is p G P such that for all n e u, p <n Pn (such a sequence 
{Pn)n£uj is ealled a. fusion sequence and p is a fusion of (pn)nGcj) nnd 

2. if A C P is an antiehain and p G P, then for every n E u there 
is q <n P sueh that q is eompatible with at most eountably many 
members of A. 

Axiom A ean be seen as a generalisation of the e.e.e. sinee eaeh 
e.e.e. foreing satisfies Axiom A: Just let <„ be equality for all n > 0. 
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Axiom A forcings belong to a wider class of forcing notions preserving 
Ml, namely the class of proper forcings [She98, III.l]. Properness was 
introduced by Shelah as a strengthening of “not collapsing Ki” which 
behaves nicely with respect to iterations. Countable support iterations of 
proper forcings are again proper while countable support iterations of 
Ki-preserving forcing notions may collapse Ki [She98, III.3]. 

Instead of invoking properness it can be seen very quickly that Axiom 
A suffices to guarantee that Ki is preserved. 

Lemma 2.3. Let P be a forcing notion satisfying Axiom A. Then for 
every P-generic filter G over the ground model V the following holds: If 
a; is a countable set of ordinals in V[G], then there is a set y G C such 
that a; C y and y is countable in C. In particular, forcing with P does not 
collapse Ml. 

Proof. Let a; be a P-name and suppose that p G P forces that a; is a 
countable set of ordinals. We will construct a fusion sequence (gn)nstj 
starting with qo = p and a countable set y such that for every fusion q of 
this sequence, g Ih a; C p. We start by choosing a name h for a function 
from u into the ordinals such that p forces to be a function from u 
onto X. The existence of h is guaranteed by the Maximality Principle. 
Let n G cc and suppose we have already defined The set of conditions 
deciding h{n) is dense in P. It follows that there is a maximal antichain 

C P consisting of conditions that decide h{n). By condition (ii) of 
Axiom A there is g„+i <„ such that is compatible with at most 
countably many elements of An. 

Let g be a fusion of {qn)n&uj, and put 

y := {a : 3g' < g3n G u{q' Ih h{n) = a)}. 

Then, whenever G is P-generic over V and g G G, for every n G cc there 
is g' G G such that q' < q and q' Ih h{n) = a where a = hcin). (Recall 
that I^[G] has the same ordinals as V.) It follows that xg = hc[uj] C y. 

It remains to show that y is countable. Let n G cc and g' < g. Since An 
is a maximal antichain, there is r G such that q' and r are compatible. 
By the choice of An, there is such that r Ih h{n) = It follows 
that if q' decides h{n), then g' decides h{n) to be a^-. Since q' < g, 
r and g are compatible. By the choice of g„+i and since g < g^+i, g 
is compatible with at most countably many elements of An- It follows 
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that {a : 3q' < q{q' Ih h{n) = a)} is countable. But this implies the 
countability of y. q.e.d. 

The conclusion of Lemma 2.3 is useful enough to get a name. 

Definition 2.4. Let Vq and Vi be models of set theory such that Vq C Vi 
and Vo and Vi have the same ordinals. We say that Vi has the ^o-covering 
property over Vq iff for every set A G Vi of ordinals which is countable 
in Vi there is a set i? G Vq of ordinals which is countable in Vq such that 
AC B. 

Lemma 2.3 says that if P is a forcing notion satisfying Axiom A, 
then for every P- generic filter G over the ground model V, V[G] has the 
Ko-covering property over V. Propemess instead of Axiom A is actually 
sufficient for this (cf. [She98, III. 1.16]). 

The definition of Axiom A seems rather technical. However, it is 
a quite natural property which is typically satisfied by forcing notions 
where the conditions are trees which are required to split often. Let us 
show 

Lemma 2.5. Sacks forcing § satisfies Axiom A. 

Proof. For n C u and p G §, letp” consist of those t C p that are minimal 
in p (with respect to C) such that t has exactly n proper initial segments 
that have two immediate successors in p. For p, g G S let p <„ g iff p < g 
and p” = g”. Suppose (p„)nei.j is a sequence in S such that p„+i <„ p„ 
for all n C uj. Then g := f]Pn is easily seen to be a perfect tree, i.e., an 
element of §. Clearly, g <„ p„ for every n G cc. In the context of Sacks 
forcing we will refer to the intersection of a fusion sequence as the fusion 
of the sequence. 

Now let n G cc, p G §, and suppose that A is an antichain in §. 
Every a G 2” uniquely determines an element of p” (using the natural 
bijection between 2” and pA- Let p * a := {s C p : s C V to- C sj. 
Clearly, p * a C S and p* a <p. 

For every a G 2” let g^, < p * a be such that q^ is compatible with 
at most one element of A. Now g := lJae 2 ’" qa C S and g <„ p. Note 
that with this definition of g, for every u G 2*^ we have g^r = g * a. If g 
is compatible with some r C A, then there is a G 2*^ such that q * a is 
compatible with r. But every q*ais compatible with at most one element 
of A. It follows that g is compatible with at most 2"^ elements of A. q.e.d. 
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Since Sacks forcing satisfies Axiom A, it does not collapse Ki. Un- 
der CH, § is of size Ki and therefore does not collapse any cardinal above 
Ki, either. However, if CH fails in the ground model, forcing with § could 
easily collapse 2^°. In fact, it is possible to have a ground model V such 
that CH fails in V and whenever G is §-generic over V, 2^° = Ki in V[G]. 
We will give the argument for this in Section 7. Petr Simon [Simi93] 
showed that adding a Sacks real collapses to the unboundedness num- 
ber b, the least size of a set U C such that for all / : cu — cu there is 
g E U such that for infinitely many n E to, g{n) > f{n). But b < 2^0 is 
not the only reason for Sacks forcing to collapse cardinals. 

Judah, Miller, and Shelah in [JudMiloShe92] showed that it is con- 
sistent with Martin’s Axiom (which implies b = 2^°) that Sacks forcing 
collapses cardinals. On the other hand, Shelah showed that it is consis- 
tent with ZFC -f -iCH that Sacks forcing does not collapse cardinals (cf. 
[CaroLav89]). Moreover, Carlson and Laver [CaroLav89] showed that 
some strengthened form of Martin’s Axiom implies that Sacks forcing 
does not collapse 2^° to Ki. This stronger version of Martin’s axiom is 
known to be consistent with ZFC -I- 2^°=K2. This also implies that it 
is consistent with ZFC -I- -iCH that Sacks forcing does not collapse any 
cardinal. 

Let us go back to the proof of Lemma 2.5. This proof shows that 
Sacks forcing § actually satisfies a stronger form of Axiom A: Given a 
condition p, an integer n, and an antichain A, we can find q <n P which is 
compatible with only finitely many members of A. This already implies 
that Sacks forcing has the so-called Sacks property. However, we can do 
even better. 

Definition 2.6. Let f : u ^ C : cu — > is an f-cone iff 

for all n E uj, \G{n)\ < f{n). A real r E is covered by G iff for all 
n E uj, r{n) E G{n). Let Uq and Vi be models of set theory such that 
Vo U Vi- We say that Vi has the Sacks property over Vq iff every real 
r : cu — cu in Ui is covered by a 2”-cone G : u ^ in Y>- (Here 2” 
is meant as a shortcut for the function n h- > 2”.) A forcing notion P has 
the Sacks property iff for every P-generic filter G over the ground model 
V, V[G] has the Sacks property over V. 

A subtree T of is binary iff every t eT has at most 2 immediate 
successors in T. A real r : cu — cu is covered by T iff r G [T]. Let Vq and 
Vi be as before. Then Vi has the 2-localization property over Vq iff every 
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real r G in Vi is covered by a binary tree in Vq. A forcing notion P 
has the 2-localization property iff for every P-generic filter G over the 
ground model V, V[G] has the 2-localization property over V. 

The Sacks property clearly follows from the 2-localization property. 

Lemma 2.7. Sacks forcing has the 2-localization property. In particular, 
it has the Sacks property. 

Proof. Let p G §, and suppose that i is an §-name such that p Ih i G 
It suffices to find a binary tree T and q < p such that q forces i to be a 
branch through T. The condition q will be the fusion of a fusion sequence 
{Pn)n&uj- We then define T to be 

Tq{z) := {s G < q{q Ih s C i)}, 

the tree of q-possibilities for z. Note that q forces i to be a branch of 
Tg{z). While choosing the sequence (pn)neoj all we have to make sure is 
that Tq{z) becomes binary. 

The set 

{g G § : g decides all of i} 

U{gG§:Vr<g(r does not decide all of z)} 

is dense in §. If there is some q < p which decides all of i we are done 
since in this case Tq{z) does not have incomparable elements. Thus, we 
may assume that no condition below p decides all of i. 

For every condition g < p let Zq be the longest initial segment of 
i that is decided by g. In particular, g Ih i. Since no condition 

below p decides all of i, Zq G Let po := p. Let n G a; and suppose 
that we have already chosen p„. For a G 2” consider the conditions 
Pn * (o'"'0) and pn * where s^i denotes the concatenation of s 

with the sequence of length 1 that has the value i. Since p„ * (<t"' 0) and 
Pn * do not decide all of i, there are q^^Q < Pn * and 

qa^i < Pn*(<r"'l) such that Zq^^^ and Zq^^^ are incomparable with respect 
to c. Now Pn-l-l • Ujg 2 ,o-e 2 ’* —n Pn- 

Let g := Hneo. Pn- It is easily checked by induction on n that for all 
n E oj the finite tree T„ which is generated by (z.e., consists of all initial 
segments of elements of) {zq^„ : a G 2”} has the following properties: 
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1 . is a finite binary tree of height at least n, 

2. C T = Tq{z), and 

3. if t G T is of length < n, then t G T„. 

It follows that T = IJnsoj and that T is binary. q.e.d. 

Let us analyze this proof. For every p G S and every name i for an 
element of such that no condition below p decides all of i we found a 
condition q < p such that Tq{z) is binary. It follows from the construction 
of q that 

h:[q]^ [Tg{z)]]x ^ [J{zq^cT : (t e A x e [q * a]} 

is well-defined and in fact a homeomorphism. Obviously, q forces the 
Sacks real to be a branch of q (every condition in § does this). Moreover, 
q forces that h maps the Sacks real to i. 

This shows the following: Let G be §-generic over the ground model 
V. Then for every element 2 : of in V[G] with z there is a home- 
omorphism h E V between a perfect subset of 2^^ (the set [g] for some 
suitable q e G) and a perfect subset of (the set [Tq{z)] for some 
name i for z) such that (in 1^[G]) h maps the Sacks real x added by 
G to 2 :. The homeomorphism h induces an order isomorphism between 
the set of perfect subtrees of q and the set of perfect subtrees of Tg{z). 
The partial orders of perfect subtrees of q and of Tg(f) are clearly iso- 
morphic to §. It is easily checked that {p < q : z E [p] } is a generic 
filter for the partial order of perfect subtrees of q. But this implies that 
{p E S ■. p C Tq{z) A z E [p]} is a generic filter for the partial order of 
perfect subtrees of Tq{z). From this fact it easily follows that if 2 : hap- 
pens to be an element of 2“^, then z is a Sacks real belonging to some 
§-generic filter which is usually different from, but interdefinable with G 
using ground model parameters. It follows that every new element of 2^^ 
is again a Sacks real. (This was proved by Sacks in [SacVl].) 

The existence of a ground model homeomorphism h mapping the 
original Sacks real x to z implies that x can be reconstructed from using 
a ground model parameter, namely h (or rather h~^). This observation 
already indicates the minimality of the Sacks extension mentioned in the 
introduction: For every §-generic filter G over the ground model V and 
every model W of set theory such that V W ^V[G], either W = V 
or V[G] = W. A real r which codes (i.e., is interdefinable with) some 
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P-generic filter G over V for some partial order P is called a minimal 
real iff V[G] is a minimal extension of V. 

From [Jec78, Lemma 25.2 and Lemma 25.3] we quote the following: 
If G is a P-generic filter over the ground model V for some forcing notion 
P and IL is a model of ZFC such that V C IL ^ V[G], then there is a 
set A of ordinals such that IV = 1^[A] where V[A] is the smallest model 
of ZFC which extends V and contains A. Note that it is not obvious that 
V[A] exists at all for a given set A e V[G] of ordinals. However, it can 
be shown that for every set A eV[G] of ordinals there is a forcing notion 
Q G L and a complete embedding e : Q — P such that l^[e“^[G]] is the 
smallest model of ZFC which extends V and contains A. 

A mapping e : Q — P is a complete embedding iff for all g, q' G Q, 

- q <q' implies e{q) < e(g'), and 

- e(g) _L e(g') iff q ± q' , 

and 



- for all p G P there is g G Q such that for all g' G Q with g' < g, e(g') 
is compatible with p. 

If e : Q — P is a complete embedding (in the ground model V), then for 
every P-generic filter G over V, e~^[G] is Q-generic over V. 

Lemma 2.8. Sacks reals are minimal. 

Proof. Let A be a name for a set of ordinals. Then there is an ordinal 
a such that 1§ forces A to be a subset of a. Let i be a name for the 
characteristic function of A from a to 2. Let p G § and let r be a name 
for the Sacks real added by the §-generic filter. We have to find g < p 
such that g decides all of i or such that for every §-generic filter G over 
the ground model V with q E G,tg can be reconstructed from zq using 
parameters in V. 

We proceed essentially as in the proof of Lemma 2.7. Again we may 
assume that no condition below p decides all of i since otherwise we 
are done. For every condition g < p let Zg be the longest initial segment 
of i that is decided by g, as before. Since no condition below p decides 
all of i, the domain of Zg is an ordinal < a. Let po := p. Let n G cu 
and suppose that we have already chosen p„. For a G 2” consider the 
conditions p„ * (<t"" 0) and p„ * (<t'" 1). Since p„ * (<t'" 0) and p„ * 
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do not decide all of i, there are <Pn* (<r"'0) and < Pn* 
such that Zq and Zq are incomparable with respect to C. Letp„+i : = 

Uje2,(re2" Qcr'^i ~n Pn- 

Let q ;= flnso; Pn- Now if G is S-generic over V and q G G, then re 
is the unique element of [q] such that for all a G 2^“^ with f-Q G [q * a], 
Zq*a C zg- This shows that q e G implies re G V^[24g] and we are done. 

q.e.d. 

Brendle in [BrciOOb] proved some interesting theorems about the set 
of all Sacks reals over the ground model 1^ in a generic extension V[G]. 
He showed that after adding a single Sacks real over V, the set of ground 
model reals is in the ideal Since every new real is a Sacks real, this 
means that the set of Sacks reals is big with respect to i.e., its com- 
plement is in It follows that after adding a Sacks real xq over V and 
then adding a Sacks real xi over V^[a;o], in V[xq] [xi] the set of Sacks reals 
over V is in 

The results about Sacks forcing mentioned so far show that when 
adding a single Sacks real, we have a very good control about all the 
new reals. Let us mention another result along these lines. Let G be an 
ultrafilter on lo in the ground model V. Suppose G is P-generic over 
V for some forcing notion P. Then G is destroyed by adding G if the 
filter generated by U in V[G] is not an ultrafilter anymore. Otherwise the 
ultrafilter is preserved. It can be shown that adding any new real destroys 
some ultrafilter from the ground model {cf. [Bar2lud95, Theorem 6.2.2]). 
However, some ground model ultrafilters are preserved when adding a 
Sacks real. 

An ultrafilter W on ci; is a p-point if for every countable family hi 
there is a set 17 G 7/ which is almost included in all elements of T . The set 
U is called a pseudo-intersection ofiF. In [Bar2lud95, Theorem 7.3.48] it 
was shown that Miller forcing (see Section 4 for the definition of Miller 
forcing) preserves p-points. The same proof works for Sacks forcing. 

Lemma 2.9. If U is a p-point in the ground model V and G is §-generic 
over V, then U generates a p-point in V [G] . In other words. Sacks forcing 
preserves p-points. 

It should be pointed out that it is very easy to destroy every ground 
model ultrafilter by adding a new real. A real r G 2^^ is called a splitting 
real over the ground model V if for A := r“^(l) and all B G p(co) fl V 
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that are neither finite nor cofinite, A B and B\A are both infinite. 
Both random reals and Cohen reals are splitting. It is easily checked that 
adding a splitting real destroys all ultrafilters from the ground model. 

To conclude this section, we mention a result about the II 3 -theory of 
L[t] where t is a Sacks real over Godel’s universe L of constructible sets. 
Note that the minimality of the Sacks extension implies that by adding a 
Sacks real t to L, one obtains a minimal model of V 7 ^ L, i.e., for every 
class C C L[t] that is a model of ZFC, either C = L[t] or C = L, and 
in the latter case C |= V = L. 

Recall that ^{x) is a S^-formula if is of the form 

3ri e Vr2 G i/’(T,ri,r2) 

where ip is arithmetical, i.e., only has quantifiers ranging over natural 
numbers (see, e.g., [Jec78, 7.40]). Shoenfield’s Absoluteness Theorem 
{cf. [Jec78, Theorem 98]) implies that S^-formulas are absolute for forc- 
ing extensions. That is, if V [G] is a forcing extension of the ground model 
V, (f(x) is a S 2 -formula, and r is an element of in V, then (f(r) holds 
in V iff it holds in V[G]. 

A Ilg-statement is a statement of the form 

Vro G 3ri G 0 ;“^ Vr2 G Ti, T2) 

where ip is arithmetical. 

Lemma 2.10. If 99 is a Ilg-statement which holds in a universe contain- 
ing a non-constructible real, then it holds in L [G] where G is §-generic 
over L. 

It follows that if there is any model V which is different from L such 
that in V every real r satisfies the S^-formula 

3ri G Vr2 G 'pi^r, ri, T2), 

then Sacks forcing over L adds no real for which this statement fails. 

As Woodin remarked in [Woo99, Theorem 1.5], Lemma 2.10 is the 
consequence of the following result of Mansfield (cf. [Jec78, Theorem 
99]): 

Theorem 2.11. Let A be a set of reals defined by a S^-formula with 
constructible parameters. If A contains a non-constructible element, then 
it includes a constructibly coded perfect subset. 
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To see why this implies Lemma 2.10, let L be a model of ZFC eon- 
taining a non-eonstruetible real, let ip{x) be a S^-formula with only eon- 
struetible parameters, and suppose that V |= Wr(p{r). Fix a generie exten- 
sion V[G] in whieh K]" is eountable. Then in partieular, all eonstruetible 
dense sets of § are eountable in V[G]. We have already seen that in the 
Saeks extension every new real is itself a Saeks real. Therefore it suf- 
fiees to show that the Saeks generie real satisfies (p in L[G] where G is 
§-generie over L. 

In order to show this, let T G § fl L. In V[G], A = {r E [T] \ (f{r)} 
is a S 2 -set eontaining a non-eonstruetible element (from V), and thus 
Theorem 2.11 yields a perfeet tree T' G L sueh that [T'] C A. By the 
ehoiee of V[G], [T'] also eontains a Saeks real over L, and sinee S^- 
statements are absolute, neither T' nor any perfeet subtree of it ean foree 

As Brendle and Lowe observed in [BreiLdw99], a slight generaliza- 
tion of Theorem 2. 1 1 ean be used to show the equivalenee of the state- 
ments “every S^-set of reals is s-measurable” and “every A^-set of reals 
is s-measurable”. Here a set of reals is S 2 iff it is the set of reals satis- 
fying a fixed S^-formula with a fixed additional real parameter. A set of 
reals is A 2 iff the set and its eomplement are S 2 . The elass of S 2 -sets 
ean also be defined topologieally (cf. [Jee78, p. 500]). The S{-sets are 
the analytic sets, i.e., the projeetions of Borel sets, the IlJ-sets are the 
eomplements of Sj-sets, and the S^-sets are the projeetions of Ilj-sets. 

3 Consequences of the Sacks property 

Before diseussing eonsequenees of the Saeks property, let us observe that 
in the definition of the Saeks property it is not essential that the size of 
G{n) is bounded by 2”. The number 2” eould be replaeed by any other 
non-deereasing unbounded funetion from oj to u. This ean be seen as 
follows: 

Let Vo and Vi be models of set theory with Vq C Vi. Let f,g:u^ 
o;\{0} be non-deereasing and unbounded funetions in Vq. Suppose that 
every real in Vi is eovered by an /-eone in Vq. In other words, suppose 
that Vi has the Saeks property over Vq with 2” replaeed by /. We show 
that every new real in Vi is eovered by a g-cone from Vq. Note that we 
may assume /(O) < 5f(0) sinee every /-eone is the eoordinatewise union 
of finitely many /-eones G with |C(0)| = 1. 




ON SACKS FORCING AND THE SACKS PROPERTY 



115 



Let r : a; — > a; be a real in Vi. In Vq choose a strictly increasing se- 
quence of natural numbers such that for every n e u, g{nin) > 

f{n). By our assumption /(O) < (7(0), we may assume mo = 0 . Identi- 
fying oj with for a moment, we find an /-cone C \ u ^ in 

Vo which covers {r\mn)n&uj- Let D \ uj ^ be defined as follows: 

For every n G c<; let D{n) := {s(n) : s E C{n) An E dom(s)}. Now for 
every n E u, if m is in the interval [m„, m„+i), then \D{m)\ < f{n) < 
g{fnn) < g{fn). It follows that D is a (7-cone in Vq covering r. 

The Sacks property implies that the new reals in the generic extension 
are very well controlled by ground model reals. For example, if Vi has 
the Sacks property over Vo, then the 2^° of Vo is still uncountable in Vi. 
This is because in Vi, countably many 2"^-cones do not suffice to cover 
all of but the 2"^-cones from Vq do cover all of 

We mention two more examples of this: If a forcing notion P has the 
Sacks property and G is P-generic over the ground model V, then the 
ideal of measure zero sets in the generic extension V[G] is gener- 
ated by the measure zero Borel sets from the ground modeV. Similarly, 
the ideal of nowhere dense subsets of 2“^ in V[G] is generated by the 
nowhere dense Borel sets in V. Both of these consequences of the Sacks 
property follow from the following lemma. 

Lemma 3.1. Let Vq and Vi be models of set theory such that Vq C Vi 
and Vo and Vi have the same ordinals. Suppose that Vi has the Sacks 
property over Vq. Then 

a) for every measure zero set A C M in Vi there is a Borel set 5 C M 
in Vo of measure zero such that A C B holds in Vi and 

b) for every nowhere dense set A C 2 “^ in Vi there is a nowhere dense 
Borel set B <E 2 ‘^ in Vq such that A C B holds in Vi. 

Proof, a) Let A denote the Lebesgue measure on M. Recall that if A C M 
is of measure zero, then for every sequence {en)n&uj of positive real num- 
bers there is a sequence {Un)n£uj of finite unions of open intervals with 
rational endpoints such that A C Un and for all n E u, A(f/„) < 

We do some preparations in Vq. Let (On)neoj be an enumeration of all 
finite unions of open intervals with rational endpoints, let e : ci; x ci; — cu 

^ Whether a Borel code codes a measure zero set is absolute, that is, it does not depend on the 
model of set theory in which we evaluate the Borel code. The analogue is true for Borel codes 
of nowhere dense sets and of meager sets. Cf. [Jec78, Lemma 42.4]. 
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be a bijection, and fix a matrix of positive real numbers such 

that for alH G a;, 



E 



Sjj 

D 2* 



j&UJ 



Now for every i & ui there is /* G in Vi such that A C O/di) 
and for every j G a;, 



• MO/.O)) 



< e 



ZJ- 



Consider the real r : u ^ oo defined by r(n) := m iff m = fi{j) 
and n = e(i, j). By the Sacks property of P there is a 2”-cone C : cu — > 
[ 0 ;]^^° in Vo that covers r. By the definition of r, A is a subset of 



^*:=nUU{0™^ m G C{e{i,j)) A ■ A(O^) < Eij}. 

iGu) JGlo 



Since only ground model parameters are used in the definition of B, 
B is a Borel set coded in the ground model. For all i,j G u, the set 
C{e{i,j)) has at most elements. It follows that the measure of 
: m G C{e{i,j)) A ■ \{Om) < £ij} is not greater than 
Eij. Therefore, and by the choice of (Eij)ij^^, for every i e 00 , 

a( U 1 J{ 0 ™ : m e C(e(i,j)) A . A( 0 „.) < £y}) < i 



It follows that B is of measure zero. 

b) Note that if A C 2^^ is nowhere dense, then for all n G a; there is 
m > n and t : [n, m) — > 2 such that {x G 2‘^ : f C x} is disjoint from A. 
(Here [n, m) denotes the interval {n, . . . , m — 1} in oj.) This can be seen 
as follows: Let n E u and let (sj)j<2" be an enumeration of 2". Since 
A is nowhere dense, there is a sequence (fi)i<2" in 2<“^ such that for all 
i <2‘^ the set {x E 2‘^ : s^tQ .. A ti C x} is disjoint from A (recall that 
sA is the concatenation of the two finite sequences s and t). Now it is 
easy to check that t \= .. A 12 ^^ has the desired property. 

Using this observation we can choose a strictly increasing sequence 
(iTi)iea; of natural numbers and a sequence (ti)nea; such that for all i E uj, 
ti E and {a; G 2“^ : fj C x} is disjoint from A. In other words, 

letting 2 : := ... we have the following: For alH G a; the set {x E 

2“ : x\[ni,ni+i) = z\[ni,ni+i)} is disjoint from A. This property of 2 : 
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does not change if we replace the sequence by another strictly 

increasing sequence provided for every i e u there is j Go; 

such that [rij, Uj+i) C [rrii, rrii+i). By the Sacks property, there is a 2”- 
cone C \ uj ^ in K) covering i i— > n*. Putting mo := max(C(0)) 
and rrii+i := max(C(mj + 1)) for every i e u, we obtain sequence in Vq 
such that for every i G c<;theset{a; G 2‘^ : t |'[mj, mj+i) = 2 : |'[mj, mj+i)} 
is disjoint from A. 

Now let {ji)i^uj be a sequence in Vq of natural numbers such that for 
all 7 G a;, jj+i — ji > 2\ By the Sacks property, there is a 2”-cone 
C : u ^ covering i 1 — > z\mj^^^. We may assume that for 

every i ^ u every t e C{i) has dom(t) = Since ji+i — ji > 2* and 

|C(i)| < 2\ there is Ui : 2 such that for every t e C{i) 

there is k e [ji, ji+i) such that t agrees with ui on [mk,mk+i). Since 
G C{i), no X e A extends Ui. The sequence {ui)i(z^ can be 
chosen in Vq. Now the set {t G 2“^ : G u{ui C x)} is coded in Vq and 

disjoint from AinVi. Its complement B is a nowhere dense Borel set in 
the ground model which covers A in Vi. q.e.d. 

The last lemma implies that forcing notions with the Sacks property 
neither add Cohen reals nor random reals. Moreover, we can compute 
the cofinality of A/”, the least size of a family JF C A/” such that for all 
A G A/” there is 5 G JF with A C 5, in forcing extensions with the Sacks 
property over the ground model. 

Corollary 3.2. Let Vo and Vj be models of set theory such that Vo C Vj, 
Vq and Vj have the same ordinals, and Vj has the Sacks property over Vj. 
Then in Vj the cofinality of A/” (ordered by C) is at most |(2^°)'^° | where 
(2^°)^° denotes the ordinal which is the size of 2‘^ in Vj. In particular, if 
Vj satisfies CH, then the cofinality of A/” is Ki in Vj. 

Proof. By Lemma 3.1, the Borel measure zero sets coded in Vj are co- 
final in J\f^^ . In Vj there is a bijection between 2^° and all Borel subsets 
of 2“^. Since (2^°)^° is an ordinal, and since Vj and Vj have the same 
ordinals, in Vj there is a bijection between |(2^°)'^°| and all Borel sets in 
2‘^ which are coded in Vj. Therefore, in Vj the cofinality of A/” is at most 
|( 2 ^ojVb|_ Now suppose that Vj satisfies CH. Then Vj |= |(2^°)'^°| < Ki. 
But the cofinality of A/” is uncountable and therefore it must be Ki in Vj. 

q.e.d. 
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The cofinality of M is the largest cardinal that appears in Cichon’s 
diagram {cf. [Bar2Jud95, Chapter 2]). Thus, Corollary 3.2 implies that if 
one starts from a model of CH and then adds a P-generic filter over the 
ground model for some forcing notion P which has the Sacks property, 
then one ends up with a model where all the cardinals in Cichon’s dia- 
gram are Ml. Of course this is only interesting if there is a forcing notion 
with the Sacks property that increases 2^°. We will go back to this when 
we discuss iterated forcing. The cardinals in Cichon’s diagram, as well 
as other cardinal characteristics of the continuum, are studied in depth in 
[BaT2Jud95]. In this book the values of many cardinal characteristics of 
the continuum are computed in various models of set theory. Corollary 
3.2 and further characterizations of the Sacks property for proper forcing 
notions can be found there. For a nice overview of cardinal characteristics 
of the continuum see [Blaicx)]. 

4 Variants of Sacks forcing 

We give a brief overview of some relatives of Sacks forcing. Except for 
forcing with finitely branching trees, all the forcing notions we mention 
in this section are discussed in [Bar2Jud95]. 

All the variants of Sacks forcing in this section satisfy Axiom A and 
have the Laver property, a weakening of the Sacks property. A forcing 
notion P has the Laver property iff for every P-generic filter G over the 
ground model V and every r : oj ^ u mV[G] the following holds: If r 
is bounded by a ground model real, i.e., if there ish : u ^ u mV such 
that for all but finitely many n G ci; we have r{n) < b{n) in V[G], then r 
is covered by a 2”-cone from the ground model. 

Since the Sacks property implies that every r : u; — > o; in the generic 
extension is bounded by some ground model real, the Sacks property is 
equivalent to the Laver property together with the property that every 
element of in the extension is bounded by a ground model real. A 
forcing notion with the latter property is called -bounding^ . Random 
real forcing and Cohen forcing do not have the Laver property, because 
the Laver property implies that any new real in the generic extension 
can be covered by a meager set of measure zero. In particular, forcing 
notions with the Laver property neither add random nor Cohen reals. Ex- 



For example, random real forcing is -bounding, Cohen forcing is not. 
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cept for Mathias forcing, all the variants of Sacks forcing in this section 
give minimal generic extensions like Sacks forcing does. 

We first summarize the properties of Sacks forcing §. Sacks forcing 
consists of all perfect subtrees of 2^‘^, ordered by inclusion. It has the 
Sacks property (even the 2-localization property) and preserves p-points. 
Moreover, if 99 is a Ilg-statement which holds in some model of set theory 
with a non-constructible real, then it holds in L [G] where G is S-generic 
over L. 

Forcing with finitely branching trees. There are some forcing no- 
tions which are very closely related to Sacks forcing in that the conditions 
are also finitely branching trees with certain perfectness properties. For 
example, for n G ci; with n > 2 a subtree p of n^‘-^ is n-perfect iff ev- 
ery s G p has an extension which has exactly n immediate successors in 
p. The forcing notion consisting of n-perfect trees ordered by inclusion 
has the n-localization property (which is obtained from the 2-localization 
property by replacing “binary tree” by “n-ary tree” where n-ary trees are 
those trees in which every node has at most n immediate successors) 
but not the (n — 1) -localization property [NewRos93]. Note that the n- 
localization property implies the Sacks property. 

Laver forcing L [Lav76] consists of all subtrees p of such that 
almost all s G p have infinitely many immediate successors in p. The set 
L is ordered by inclusion. Laver forcing adds a dominating real, i.e., a 
function d : u ^ u which bounds all ground model functions from oj 
to u. This implies that Laver forcing destroys every ultrafilter from the 
ground model. If G is L-generic over the ground model V, then in V[G] 
the set M n y has outer measure one. 

Modifying Mansfield’s proof of Theorem 2.11, one can show that if 
<p is a Ilg-statement which holds in a universe with a dominating real 
over L, then <p holds in L[G] where G is L-generic over L. For this no- 
tice that if there is a dominating real over L, then there is a so-called 
strongly dominating real over L (cf. [Go1i-f 95]). Moreover, if A is a E^- 
set in constructible parameters which contains a strongly dominating real 
over L, then A contains all the branches of a constructibly coded Laver 
condition. This can be shown using ideas from [Go1i-f95]. 

Miller forcing or rational perfect set forcing MP [Milo 84] consists 
of all subtrees p of such that every s G p has an extension t E p 
which has infinitely many immediate successors in p. The elements of 
MP are sometimes called superperfect trees. Miller forcing is ordered by 
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inclusion. It preserves p-points and therefore does not add a dominating 
real. If G is MP-generie over the ground model V, then M fl is neither 
meager nor of measure zero in V[G]. 

As for Laver foreing, a variant of Theorem 2. 1 1 is true for Miller fore- 
ing. A real r \ oj ^ u is unbounded over a elass G iff it is not bounded 
by a funetion in ft G. Using ideas from [KeeVV] it is possible to show 
that every eonstruetibly eoded S^-set whieh eontains an unbounded real 
over L eontains all the branehes of a eonstruetibly eoded Miller eondi- 
tion. This implies that every Ilg-statement whieh holds in a model of set 
theory with an unbounded real over L also holds in every model of the 
form L[G] where G is MP-generie over L. 

Mathias forcing M [MatVV] eonsists of all pairs (s, A) sueh that s C 
oj is finite and A C a;\ IJ s is infinite. The set M is ordered as follows: 
(s, A) < {t, B) iff t C s, A C B and s\t C B. Mathias foreing adds a 
dominating real. If G is M-generie over the ground model V, then M fl U 
is a meager set of measure zero in V[G]. 

Of eourse, we ean only mention the most popular relatives of Saeks 
foreing. More foreing notions whieh use trees as eonditions, usually 
ealled tree foreings, are studied in [Brei95]. A general framework for 
eonstrueting tree foreings (ereature foreing with tree creatures) was de- 
veloped by Roslanowski and Shelah in [RosShe99]. 

Some foreing notions ean be eonsidered as suborders of Saeks for- 
eing, but have not been mentioned in this seetion sinee they seem to be 
based on a different philosophy. Cohen foreing is an example of this: 
Every Cohen eondition p, a finite partial funetion from u to 2, determines 
a perfeet tree 



Tp := {g G 2'^‘^ : p U g is a funetion}. 

The mapping p i— > Tp is an embedding of Cohen foreing into Saeks fore- 
ing, but this embedding is obviously not a eomplete embedding. In the 
same way. Silver foreing and Grigorieff foreing ean be eonsidered as sub- 
orders of Saeks foreing.^ However, Silver foreing and Grigorieff foreing 
are mueh eloser to Saeks foreing than Cohen foreing is sinee they use as 
their eonditions partial funetions from cc to 2 whose domains may be infi- 
nite. They are both proper. Grigorieff foreing is -bounding, but Silver 

^ Cf. [She98, VI.4] for the definitions of these two forcing notions. Shelah used Grigorieff forc- 
ing to construct a model of set theory without p-points. 
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forcing is not. Note that the Silver forcing in [She98, VI.4] is different 
from the similar Pffkxy-Silver forcing as defined in [Bau83, Section 7]. 
Pnkry-Silver forcing consists of all functions from co-infinite subsets of 
cc to 2 and is ordered by reverse inclusion. Pffkry-Silver forcing satisfies 
Axiom A. 

In the next section we mention some results about possible c.c.c. rel- 
atives of Sacks forcing. 

5 The Sacks property and the c.c.c. 

In the past many people investigated the question how close or how far 
away a c.c.c. forcing notion can be from Cohen forcing or random real 
forcing. Here “being close” simply means that Cohen forcing, respec- 
tively random real forcing can be completely embedded into the given 
forcing notion. If so, then such a forcing also adds a Cohen, respectively 
random real. One instance of this question, namely the question whether 
random real forcing completely embeds into every cc'^-bounding forcing 
notion that is c.c.c. goes back to the Scottish Book [Mau81] problem of 
von Neumann about measure algebras. 

Since the Sacks property implies that neither Cohen nor random reals 
are added, it is a good test question to ask when a c.c.c. forcing can 
have the Sacks property. Concerning reasonably definable partial orders, 
Shelah showed in [She94] that no c.c.c. Souslin forcing has the Laver 
property where a forcing notion is Souslin iff its underlying set, the order, 
and the incompatibility relation are analytic, i.e., Sj. So in particular, a 
c.c.c. Souslin forcing cannot have the Sacks property. Moreover, Shelah 
showed that if a c.c.c. Souslin forcing adds an unbounded real, then it 
adds a Cohen real. It is unknown whether there is a model of ZFC where 
every c.c.c. forcing adding an unbounded real adds a Cohen real. 

There are some combinatorical statements which imply or negate the 
existence of c.c.c. forcings (not necessarily nicely definable) with the 
Sacks property. We mention the combinatorial statements involved and 
give a summary of results. 

<0: There is a sequence {Aa)a<uji such that for all a < cci, Aq, C a 
and for every A C cci the set {a < c<;i | A fl a = Aq,} is stationary. 

In [JenVO], Jensen constructed a c.c.c. forcing with the Sacks prop- 
erty from <0. The principle <0, which implies CH, is used in an inductive 
construction of length oji to predict initial parts of antichains of the final 
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forcing notion already at a countable stage of the construction. During 
the construction one makes sure that the predicted antichains cannot be 
extended to uncountable antichains in the final forcing notion, so that the 
final forcing notion becomes c.c.c. This is similar to the standard con- 
struction of a Souslin tree from <) as presented in [KunSO, Theorem 7.8]. 

CCC(S): For every family V of size at most of dense subsets of 
Sacks forcing § there is a c.c.c. suborder P of § such that D fl P is dense 
in P for each D G V. 

This principle together with a large size of the continuum was shown 
to be consistent with ZFC by Velickovic [Vel91]. From CCC(§) one can 
easily extract a c.c.c. suborder of § which has the Sacks property. 

OCA (Open Coloring Axiom): Let X be a subset of the reals and 
suppose that Kq and K\ are disjoint subsets of [X]^ with [X]^ = XqUXi 
such that Kq is open in the topology that [X]^ inherits from the product 
topology on X^. Then either there is an uncountable X C § such that 
[X]^ C Kq, or § = IJ.g^ Xj where [Xj]^ C Ki for every f G cc. 

From this principle, which implies that the size of the continuum is at 
least K 2 , Velickovic [Vel02] deduced that no c.c.c. forcing adding a new 
real has the Sacks property. 

Principle (*) for n: Let X be a p-ideal on [k] where X C ( [«] -^ 0 ) is 
a p-ideal if for every countable set C C X there is X G X such that every 
member of C is almost included in X. Then either there is an uncountable 
Y C K such that [X]*^ C X, or k = Uisoj where [Xj]*^ ft X = 0 for 
every f G cu. 

The second author showed in [Qui02] that this principle also im- 
plies that no c.c.c. forcing has the Sacks property. Since (*) restricted 
to K = Ki is consistent with CH [AbrTod97], this also shows that CH 
is not strong enough to decide the existence of c.c.c. forcings with the 
Sacks property. 

The case of the Laver property is different: CH already implies the ex- 
istence of a c.c.c. forcing with the Laver property. In [SheOl] it is stated 
that Mathias forcing defined relatively to a Ramsey ultrafilter is an exam- 
ple (cf. [Mat77] for the definition of Mathias forcing relative to a Ramsey 
ultrafilter). A model where such forcings do not exist was found by She- 
lah [SheOl]. He showed that the countable support iteration of Mathias 
forcing of length cu 2 yields such a model. 

To conclude this section, we list some questions in this area. 
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1. Which combinatorical principles imply or deny the existence of a 
c.c.c. forcing preserving p-points? 

2. Are there forcing notions not preserving p-points, but not adding 
splitting reals? 

3. Which combinatorial principles imply or deny the existence of a c.c.c. 
forcing not adding splitting reals? 

The motivation for the second question is to look for less “brutal” 
methods to destroy ultrafilters than adding a splitting real. The other 
questions are variants of von Neumann’s problem. As mentioned before, 
both Cohen and Random real forcing add a splitting real and therefore 
destroy every ultrafilter in the ground model. 

6 Products of Sacks forcing 

If one wants to add many Sacks reals to a model of set theory, for exam- 
ple to increase 2^°, then there are essentially two possibilities: The first 
possibility is to use some kind of side-by-side product of many copies of 
§ to add many Sacks reals over ground model at once (in a parallel way). 
The second possibility is to iterate Sacks forcing, i.e., to add one Sacks 
real after the other. We will describe the second possibility in the next 
section. Forcing with side-by-side products has two advantages over the 
iteration: First, it is technically easier to deal with and second, it is pos- 
sible to construct models with an arbitrarily large size of the continuum. 
The iteration, at least the countable support iteration, which is discussed 
in the next section, only gives models where the size of the continuum 
is at most K 2 . However, the countable support iteration of Sacks forcing 
usually yields stronger consistency results than forcing with side-by-side 
products. 

Definition 6.1. The countable support (side-by-side) product P of a fam- 
ily (Qi)ie/ of forcing notions is the set of all p G Hie/ countable 

support supt(p) :={*€/: p{i) 7^ IqJ. The set P is ordered compo- 
nentwise, i.e., for p, g G P, p < g iff for all i G /, p{i) < q{i). 

Of course one can define products with finite supports or with un- 
countable supports. However, countable supports are what is appropriate 
for forcing notions satisfying Axiom A. Something that all reasonable 
products P of a family (Qi)ie/ (like the finite support product and the 
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countable support product) have in common, is the following: For every 
i G / the embedding e* : Qj — ^ P that maps every g G Qj to the sequence 
which has q as its i-th coordinate and is 1 everywhere else is a complete 
embedding. In particular, if G is P-generic over the ground model V, 
then for every i G /, e~^[G] is Qi-generic over V. In other words, if P is 
the countable support product of k copies of Sacks forcing, then P adds k 
Sacks reals. An easy density argument shows that the Sacks reals added 
by different factors of P are indeed different. 

Under CH, countable support products of copies of Sacks forcing do 
not collapse cardinals [Bau85]. The proof of this fact consists of two 
parts: First one shows that Ki is not collapsed by a countable support 
product of copies of Sacks forcing. Then one uses CH to show that in 
countable support products of forcing notions of size at most Ki all an- 
tichains are of size at most Mi. This can be done using a simple A-system 
argument. It follows that no cardinal above Mi is collapsed (cf. [KunSO, 
Lemma 6.9]). We only show that countable support products of copies 
of Sacks forcing do not collapse Mi. In the proof presented here we get 
the 2-localization property more or less for free. The following lemma is 
essentially taken from [NewRos93]. 

Lemma 6.2. Countable support products of copies of Sacks forcing have 
the 2-localization property. In particular, they have the Sacks property 
and do not collapse Mi. 

The proof of this lemma follows closely the proof of Lemma 2.7. But 
we have to deal with countably supported ^-sequences of conditions in 
§ this time. For this we need a lot of notation. Fix a cardinal k and let P 
be the countable support product of k copies of In order to show that 
P has the 2-localization property, we have to extend the notions “fusion” 
and “fusion sequence” to P. For this it is convenient to pass from our 
original definition of fusion sequences in § to a more flexible one. 

Now a sequence {pn)n&uj in S is a fusion sequence iff there is a non- 
decreasing unbounded function f : u ^ u such that for all n G a;, 
Pn+i ^f{n) Pn- It is casily checked that a sequence {pn)n&uj which is a 
fusion sequence according to the new definition has a subsequence which 
is a fusion sequence according to the old definition. If {pn)n&uj is a fusion 
sequence in according to the new definition, its fusion flnso; Pn i® again 
a condition in §, as before. 

The elements of P are functions from k to S or equivalently, K-sequences of elements of S. 
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Definition 6.3. For a finite set F C k and rj : F — a; let the relation 
on P be defined as follows: For all p,q e P let p g if p < q 
and for all a e F, p{a) <ri{a) 9(0) • For p, F, and p as before and a G 
riaeF let p * (T be such that for all a e F, {p* a) (a) = p(o;) * a{a) 
and for all a G k\F, (p * a) (a) = p{a). 

A sequence {pn)n&uj of conditions in P is a. fusion sequence if there 
is an increasing sequence of finite subsets of k and a sequence 

iVn)n&io such that for all n e u, pn ■ ^ u, for all i e F^ we have 

Vn{i) < Pn+i{i),Pn+i <F„,r^n Pn , Rud for all o G supt (p„ ) there is 771 G u 
such that a G Fm and Pm(o) > n. If (pn)necj is a fusion sequence in P, 
then for each a E k, (pn(o))neoj is a fusion sequence in § or constant 
with value 1§. This shows that 




is a condition in P, the fusion of the sequence (pn)neaj- 

For the proof of Lemma 6.2 it is enough to show 

Lemma 6.4. For every p G P and every name i for an element of 
there are a condition q < p and a binary tree T C such that q forces 
i to be a branch of T. 

Proof. Let p and i be as above. Our goal is to construct q < p such that 

Tq{z) = {s G : 3r < q{r Ih s C i)} 

is binary. We may assume that no condition below p decides all of i. 

Let F be a finite subset of k and let p : F — a; be a function. 
A condition r G P is (F, p) -faithful iff for all a,r E IlaeF and all 
F <Fri r the following holds: If 2^'*^ and are incomparable elements 

of (with respect to C), then already and are incomparable. 

Claim 1. Suppose r G P is (F, p)-faithful. 

a) Let j E k\F and define F' := F U {j} and p' := p U {(j, 0)}. Then 
r is (F', p') -faithful. 

b) Let j3 E F, and let p' be such that for all a E F\{/9}, p'{a) = p{a) 
and p'{(3) = p{(3) + 1. Then there is a condition r' r such that r' 
is (F, p') -faithful. 
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Part a) of this claim follows immediately from the definitions. For 
the proof of b) fix an enumeration {ai, . . . , of HieF We define 
a <F,r?-descending sequence {rk)k<m below r. For k G m} and 

/ e 2 let (T^ be such that for all a G F\{(3}, al{a) = ak{a) and (J^(/9) = 
ak{/3)^l. Let tq := r. Suppose we have constructed for some k < m 
and we want to build i. Let i <Fn' be such that i,.. ,,*^0 
and are incomparable if such a condition exists. Otherwise, let 

Tfc+i := Tfc. Finally, let r' := r^. It follows from the construction that r' 
is {F, ? 7 ')-faithful. This shows part b) of the claim. 

Using a) and b) of Claim 1 together with some bookkeeping, we con- 
struct a sequence (p^, jm)meuj such that 

L {pm)m&uj is a fusion sequence in P witnessed by (F^, rjm)m&oj where 
Fo = Vo = 0, 

2. for all m E uj,Pm is (F^, ? 7 m)-faithful and < P, 

3. for all m G cc and all i G F^, pm-i-i(*) < Vm{i) + 1, 

4. for all m G cc, jm is the unique element of F^+i such that either 

jm ^ ~ Pm{jm)F 1 OX jm ^ L’m ^^^Pm+l{jm) ~ Ij 

and 

5. for all m G cc and all a G Pm* o' decides at least i I'm. 

Let q be the fusion of the pm, m G cc, and let T := Tq{z). Note 
that for all m G oo, q <Fm,rim Pm- For each m G cc let Tm+i be the tree 
generated by 

ieFm 

and let Tq be the “tree” generated by Zp^. Now each Tm is a subtree of T. 
We show that T = Tm- 

Let t G T and m := dom(t). Let r < g be a condition which forces 
that t is an initial segment of i. The set {g * a : u G IljeF™ 2’'’"^*^} is 
a maximal antichain below g. It follows that there is a G IlieF™ 2’'’"^*^ 
such that r is compatible with q*a. Since q*a < Pm* o', r is compatible 
with Pm * O'. This implies t C and thus t G Tm- 

It remains to prove that T is binary. We first show that for every m G 
oj and t G Tm, if there is s G T such that s and t are incomparable, then 
there is s' G Tm such that s' F s and already s' and t are incomparable. 
Let s, t, and m be as before. Let r < g be a condition which forces s to 
be an initial segment of i. Then there is a G HieF™ 2’'’"^*^ such that r is 
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compatible with prn * o'- Let r G n be such that t C 

Clearly, a ^ r. Let m' <m minimal such that 

a' ■= (a(o) \Tlm'{a))a&F^, 

and 

t' ■= (r(o) \Tlm'{a))aaF^, 

are different. Now s' ;= and are incomparable by the 

?7m') ‘faithfulness of Pm'- Clearly, s' G T^, and s' and t are incom- 
parable. 

Now suppose that T is not binary. Then there is t G T such that t 
has three distinct immediate successors to, t\, and t2 in T. Let m G cc 
be minimal such that contains at least two of the tj, j < 3. By what 
we proved just before, in fact contains all the tj. For every j < 3 let 
aj G riisF™ be such that tj C For every n < m and every 

j < 3 let a'j := {<Jj{a) \pnia))a&F„- Now let n < m be minimal such 
that the set {o’j' j 3} has at least two elements. Then {o’j' i j <3} 
has exactly 2 elements and n < m by the construction of F„ and pn- 
Without loss of generality we may assume that Uq and a” are different. 
Since m is minimal such that contains at least two of the tj and since 
n < m, Zp^*a^ and Zp^*„^ are comparable. However, this contradicts the 
(F„, ?7„) -faithfulness of pn- q-e.d. 

Corollary 6.5. Let C be a model of set theory satisfying CH and suppose 
that K is a cardinal V. Then there is a generic extension V[G] of V such 
that V[G] 1= 2^° > K, V[G] has the same cardinals as V, and V[G] 
has the 2-localization property over V. In particular, it is consistent with 
an arbitrarily large size of the continuum that all cardinals in Cichoh’s 
diagram are Ki. 

Proof. In V let P be the countable support product of k copies of Sacks 
forcing. By CH, P has no antichains of size K 2 and therefore does not col- 
lapse any cardinal above Ki, as we mentioned before. By Lemma 6.2, P 
does not collapse Ki. Let G be P-generic over V. Then V[G] and V have 
the same cardinals and V[G] has the Sacks property over V by Lemma 
6.2. Now it follows from Corollary 3.2 that all the cardinals in Cichoh’s 
diagram are Ki. It remains to evaluate the size of the continuum in V[G]. 
As mentioned before, it is easily checked that the Sacks reals added by 
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the different faetors of P are pairwise different. It follows that is at 
least K in y[G]. q.e.d. 

In order to determine the exaet value of 2^° in V[G] in this proof, one 
has to analyze the eonstruetion of names for subsets of u. For simplieity, 
let us assume that V satisfies the Generalized Continuum Hypothesis 
(GCH) and suppose that k is of uneountable eofinality in V. Now it ean 
be shown that in V there is a set N of size k sueh that every subset of lo 
in V[G] has a name in N . This implies that in V[G], 2^° is exaetly k. 

Countable support produets of Saeks foreing have been used by 
Baumgartner {cf. [Bau85]) to show the eonsisteney of the total failure 
of Martin’s Axiom: The topologieal version of Martin’s Axiom ean be 
stated as follows: No eompaet spaee without uneountable disjoint fami- 
lies of open sets is the union of < 2^° nowhere dense sets. The Baire Cat- 
egory Theorem implies that no eompaet spaee is the union of eountably 
many nowhere dense sets. Martin’s Axiom /a/A totally if every eompaet 
spaee without uneountable disjoint families of open sets is the union of 

nowhere dense sets, but 2^° > Ki. Baumgartner’s artiele ineludes a 
niee introduetion to eountable support produets of Saeks foreing. 



7 Iteration of Sacks forcing 



Side-by-side produets of partial orders adding reals, sueh as the eount- 
able support produets of Saeks foreing introdueed in Seetion 6, have one 
major disadvantage when it eomes to strong independenee results: The 
reals added by the different faetors of the produet are independent over 
the ground model. That is, if P is the eountable support produet of k 
eopies of Saeks foreing, G is P-generie over the the ground model V, 
and for eaeh a < k, Xa denotes the Saeks real added by the a-th faetor 
of P, then for every set A G 1^ with A C nwe have 



^2‘^'^V[{Xa)aSA] p (^2^y^[(Xa)oc(iK\A\ — 



V 



Reeall that offieially we only defined V[X] if X is a set of ordinals. But 
it should be elear how to eode {xa)a&A and {xa)a&K\A by sets of ordinals. 

In Seetion 6 we showed that foreing with eountable support produets 
of eopies of Saeks foreing over models of CH produces models of set 
theory in which many cardinal characteristics of the continuum, such as 
the cardinals in Cichoh’s diagram, are small (i.e., equal to Ki), while the 
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size of the continuum is big. However, there is one class of cardinal char- 
acteristics of the continuum whose members are still big in such models. 
Let X be a set and f : X ^ X. A point (t, y) G is covered by / 
iff f{x) = y or f{y) = x. A family T of functions from X to X covers 
A C X^ iff every point in A is covered by some member of T . 

One might ask how many continuous functions are needed to cover 

As it turns out, the number of continuous functions needed to cover 
is the the same as for [GesGoliKojcx)]. In the present context it 
is more convenient to consider 2‘^ instead of M. 

Let be a model of GCH and suppose that k > Ki is a cardinal of 
uncountable cofinality in V. Let P be the countable support product of k 
copies of Sacks forcing and let G be P-generic over V. For a < k let Xa 
denote the Sacks real added by the a-th factor of P, as above. Suppose 
that in y[G], X is a family of size < k of continuous functions from 2^^ 
to 2‘^. Since every continuous function from 2“^ to 2‘^ is in fact a closed 
subset of (2‘^)^, it can be coded (for example, using Borel codes) by a 
subset of u. Now it can be shown that there is a set A G V such that 
A is of size < k in V and T C V[{xa)a&A]- Let /9,7 G k\A. Since 
x^ ^ V[{xa)adAvj{i3}\ and xp ^ V[{xa)aaAvj{'^}\, no function from X 
covers {xp,x^). It follows that in V[G], T does not cover {2‘^Y. Since 
T was an arbitrary family of size < k of continuous functions on 2“^, 
we have shown that in V[G], (2*^)^ cannot be covered by less than n 
continuous functions. 

So, if we want to show that it is consistent that (2“^)^ can be covered 
by less than 2^° continuous functions, then we have to look for a different 
construction of models of set theory. We have to iterate Sacks forcing. 
The fundamental paper about iterated Sacks forcing is [BauLav79]. 

Definition 7.1. Let P be a partial order and let Q be a P-name for a 
partial order. Then P * Q is the partial order consisting of all pairs (p, q) 
such that p G P and g is a P-name for an element of Q. The set P * Q is 
ordered as follows: For (po, 7o), (Pi, 7i) G P * Q let (po, go) < (Pi, Qi) iff 
Po < Pi andpo Ih % < gi. 



There is an obvious embedding e : P — P * Q: Let 1q be a P-name 
for the largest element of Q. For every p G P let e(p) := (p, Iq). It 
is easily checked that e is a complete embedding. In particular, if G is 
P * Q-generic over the ground model V, then Go := e~^ [G] is P-generic 
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over V. In V[G] let Gi := {qco ■ e P((p, q) e G)}. Then Gi is Qg^- 
generie over V^[Go]. On the other hand, if Gq is P-generie over V and Gi 
is Qco^generie over V[Gq], then {{p,q) e F * Q : p e Go A qco e Gi} 
is P * Q-generie over V. 

This shows that foreing with P * Q is the same as first adding a P- 
generie filter Gq and then adding a Oco’S^iisrie filter over it. The point 
of iterated foreing is that we ean add generies over eaeh other using a 
single foreing notion in the ground model. The iteration beeomes really 
relevant only when we want to add infinitely many generies over eaeh 
other. 

Definition 7.2. Let <5 be an ordinal. A countable support iteration of 
length 5 is an objeet of the form ((Pq,)o< 5 , (Qa)a< 5 ) with the following 
properties: 

1- (Po)a <5 is a sequenee of foreing notions and Pq is the trivial foreing 
notion eontaining just one element, 

2. (Qa)a <5 is a sequenee sueh that for eaeh a < 5, is a P^-name 
for a foreing notion (more preeisely, a name for a set together with a 
name for an ordering on that set), 

3. for all a < 5, Pq,+i = Pq, * Qq, and 

4. if < (5 is a limit ordinal, then P^ eonsist of all funetions q with 
domain (3 sueh that 

(a) for all a < (3, q{a) is a Po-name for a eondition in Qq and q\a & 
Pq and 

(b) the support supt(g) := {« < /9 : Ip^ 1/ q{a) = of q is 
eountable (where is a Pa-name for the largest element of Qq) 

and P^ is ordered as follows: for p,q ^ Fp we have p < g iff for all 

a < (3,p\a Ihp^ p{a) < q{a). 

This definition deserves several remarks. First, it should be elear how 
one has to ehange the definition in order to obtain the definition ofyi- 
nite support iteration: Just replaee “eountably many” in 4b by “finitely 
many”. While finite supports are suitable for iterating e.e.e. foreings 
sinee finite support iterations of e.e.e. foreings are again e.e.e., eount- 
able supports are appropriate for Axiom A foreings sinee with eountable 
supports it is possible to extend the fusion teehnology used to deal with 
Axiom A foreings to the iteration. 
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A countable support iteration ((Po)q,< 5, (Qa)a<5) is already deter- 
mined by the Therefore, slightly abusing notation, we will frequently 
eall P5 the countable support iteration of the Q„, a < 5. For all a, P < S 
with a < P there is a natural embedding Cafs : Pa — P/j mapping every 
p G Pa to p' where p'\a = p and for all 7 G [a, P), pf'y) = The Ca/s 
are eomplete embeddings. Via Cas we may eonsider Pa as a subset of P^. 

Let G be a P^-generie filter over the ground model V. Then for every 
a < 6, Ga ■= G nFa is Pa-generie over V. Thus, we have an inereasing 
ehain 

^ c v[Gf c . . . c v[Ga] c . . . c V[G] 

of models of set theory. For every a < 5, V[G] is a generie extension 
of V[Ga], obtained by adding a Q-generie filter over V[GP\, where Q G 
V[Ga] is the so-ealled quotient 0/P5 over Ga- The partial order Q ean 
be eonstrueted in a natural way from ((Pa)a<5, (Qo)a<5) and G^ and is 
a eountable support iteration of length 5 — a}^ 

The countable support iteration of Sacks forcing of length 5 is the 
foreing notion §5 where ((Sa)o<3, (Qa)a<5) is a eountable support iter- 
ation sueh that for every a < 5, Qq is an §a-name for Saeks foreing. 
The Sacks model is a model of set theory of the form V[G] where G 
is §o;2‘gcaerie over V and V is a model of CH. The formulation “the 
Saeks model” is slightly misleading, sinee the model is not uniquely de- 
termined. However, §^2 "generie extensions of models of CH are suffi- 
eiently similar to eaeh other to be eonsidered the same most of the time. 

If the ground model satisfies CH, then has no antiehains of length 
K2 (cf [BauLav79] or, for a more general result about proper foreing, 
[She98, Chapter III, Theorem 4.1]) and therefore does not eollapse any 
eardinal > K2. Sinee eountable support iterations of Saeks foreing have 
the 2-loealization property, as we will show in a moment, they do not 
eollapse Ki. It follows that foreing with §^^2 over models of CH does not 
eollapse eardinals. Sinee §^2 has the 2-loealization property, it follows 
from Corollary 3.2 that all the eardinals in Ciehoh’s diagram are Ki in 
the Saeks model. 

If G is §5-generie over the ground model V and a < S, then the 
quotient Q of over G fl §a is (equivalent to) the eountable support 
iteration of Saeks foreing of length <5—0; in V[Ga\- Moreover, V[Ga+i] = 
V[Ga][Ta] where Xa is the Saeks real added by the last faetor of the 

Cf. [Goli93, Section 4] for more on quotient forcing. 
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iteration Sq,+i. Using this notation we see that for all f3 < S, V[Gfs] = 
y[(xa)a<i 3 ]- As in the case of countable support products, an easy density 
argument shows that the Xa are pairwise different. Analyzing -names 
for subsets of u one can see that the size of the continuum does not 
exceed K 2 in the Sacks model. It follows that the size of the continuum 
is exactly K 2 in the Sacks model. 

We already mentioned that countable support iterations of Sacks forc- 
ing cannot be used to construct models of set theory where > K 2 . 
This is because countable support iterations of nontrivial forcing notions 
of some length with cofinality Ki collapse the size of the continuum to 
Thus, if 5 is an ordinal with cofinality > Ki and G is S^-generic 
over the ground model V, then for cofinally many a < 5, U[Gq,] is a 
model of CH. It follows that 2^° cannot be bigger than M 2 in V[G], 

The Sacks model is generally known as the model of set theory in 
which all reasonable cardinal characteristics of the continuum (that can 
consistently be smaller than 2^°) are Mi. Recently this has been turned 
into a provable statement by Zapletal {cf. [Zap04]). For a large class of 
combinatorial cardinal characteristics of the continuum he showed that if 
there is a forcing extension of the ground model V in which the cardinal 
characteristic in question is smaller than 2^°, then this cardinal is Mi in 
V[G] where G is §( 2 ^ 0 )+ -generic over V. The model V[G] actually is 
“the” Sacks model. Even if V is not a model of CH, forcing with an 
initial part of S( 2 ‘<o)+» collapses the 2^° to Mi. Now the (2^°)+ of V is M 2 . 
The remaining part of the iteration is just forcing with §^2 ever a model 
of CH. 

The proof of the following theorem illustrates the main techniques 
for proving statements about the Sacks model. The theorem together with 
Lemma 7.4 summarizes a sizeable portion of what is known about this 
model of set theory. 

Theorem 7.3. In the Sacks model, (2*^)^ can be covered by Mi continu- 
ous functions from 2‘^ to 2“, while 2^° = M 2 . 

This theorem was explicitly shown in [Har2vSt02]. However, impli- 
citly it already follows from Miller’s [Milo83] about mapping sets of reals 
onto the reals and from the work of Groszek (cf. [GroiSl]) showing that 
the constructible degrees of reals in a § 0 ^ 2 -generic extension of L are 

Cf. [Goli93, Section 0] for more details. 
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wellordered of ordertype 0 J 2 . Constructible degrees of reals are defined 
as follows: For x,y & 2‘^ \e.i x < y iff x G L[i/]. The model L[y] can be 
constructed without forcing^^. A constructible degree is an equivalence 
class with respect to the relation < fl >. The order < induces an order 
on the constructible degrees, the constructibility order. Stronger versions 
of Theorem 7.3 can be found in [CiePawcx)], [Ges+02], and [StC 299 ]. 

There are more consistency results about the structure of constructible 
degrees of reals in models of set theory. Let us mention two examples. 

Groszek [Groi94] showed that it is consistent to have a reversed 
copy of oji as an initial segment of the order of constructible degrees. 
Kanovei and Zapletal [KaniZap98] showed that there is a generic exten- 
sion L[G] of L in which there are a strictly increasing sequence {an)n£uj 
of constructible degrees and a constructible degree b such that the fol- 
lowing holds: For two constructible degrees c, d let (c, d) denote the 
constructible degree which is the least upper bound of c and d. Then 
the sequence is strictly increasing and if t G L[G] is not 

an element of any intermediate model of the form L[ao, . . . , a„, 6 ], then 
L[G] = L[t]. 

We go back to the proof of Theorem 7.3. Since the number of ground 
model reals is Ki in the Sacks model, the theorem follows immediately 
from the next lemma, which gives some additional information. 

Lemma 7.4. a) Let 6 be an ordinal and suppose that G be S^-generic 
over the ground model V. For a < 5 let Gq, := G fl Sq,. Then in V[G] the 
following holds: Let t G 2“^ and let a < 5 be the first ordinal such that 
X G V[Ga\- Then for all 1 / G 2“^ fl V[Ga] there is a continuous function 
f : 2‘^ ^ 2‘^ coded in V such that f{x) = y. In particular, (2^^)^ is 
covered by the continuous functions coded in the ground model. 

b) For every ordinal 5, §<5 has the 2-localization property and there- 
fore has the Sacks property. 

Our proof of a) will be more or less the same as the proof of a similar 
statement in [GesGoliKojcx)], which talks about slightly different forcing 
notions. In order to find the required continuous functions in the ground 
model, we first observe that for 2 ‘^ an extension theorem similar to the 
Tietze-Urysohn Theorem holds. 
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Cf. [Jec78, Section 15]. 
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Claim 2. Whenever A is a elosed subset of 2‘^ and f : A ^ 2‘^ is eon- 
tinuous, then / ean be extended to a eontinuous funetion / : 2“ — 2‘^. 

This ean be seen as follows: First of all, it suffiees to extend every 
eontinuous g : A ^ 2to a eontinuous funetion g : 2‘^ ^ 2 sinee / is built 
from eountably many sueh funetions. \i g \ A ^ 2is eontinuous, then 
and are disjoint elosed subsets of 2‘^. Sinee the topology 

on 2“^ is generated by elopen sets and sinee 5 '“^( 0 ) and g~^{i) are in 
faet eompaet, there is a elopen set B C 2‘^ sueh that g~^{l) C B and 
17 ”^ (0) C 2‘^\B. Let ^ ; 2“^ — ^ 2 be the eharaeteristie funetion of B. 
Sinee B is elopen, g is eontinuous. By the ehoiee of B, g extends g. This 
finishes the proof of Claim 2. 

This extension property shows that it aetually suffiees to show that for 
X and y as in a) there are a elosed set A C 2“^ and a eontinuous funetion 
f \ A ^ 2^ sueh that / (and therefore A) is eoded in V and f(x) = y. 
Sinee we intend to work in V, we have to ehoose suitable names for x and 
y first. It is worth mentioning that no new reals are added at limit stages 
of foreing iterations of uneountable eofinality. That is, if a is minimal 
with X G V[Ga], then a is of eountable eofinality 

Claim 3. Let x be an S^-name sueh that there is no /3 < a with xq ^ G 
V[Gi 3 \. Then there is an S^-name i sueh that xq^ = and for a\\ (3 < a 
and all Sa-generie filters H over V, zh ^ ^[Hy] where Hy \= H n'Bg. 

We first show that there is p G sueh that for no §Q-generie filter H 
over V with p e H there is (3 < a sueh that x G V[Hg\. Note that every 
§/ 3 -name, (3 < a, eorresponds eanonieally to some §a-name for the same 
objeet. We may therefore identify every S/^-name with the eorresponding 
§a-name. Now eonsider the sets 

Dq := {p e Sa ■ For no S^-generie filter H over V 

with p e H there is (3 < a sueh that xh G VlHy]} 

and 

Di := {p G : There are /9 < a 

and an §^-name i sueh that p\\- x = z}. 
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Dq U Di is dense in Suppose that p G is not in Dq. Then there are 
an -generic filter H, (3 < a, and an §^-name i such that p e H and 
xh = ZHg- Since xh = zh^ there is some q & H which forces x = z. 
Since H is a filter, we may assume q < p. Clearly, q E Di. This shows 
thedensityofDoU-Di-By thegenericityofG, thereisp G Gn(Zi)oU-Di). 
If p G Di, then a is not minimal with x G V[Ga\- This shows p G Di. 
Therefore p is as required. 

Using the Maximality Principle it is now easy to construct an 
name i for an element of 2‘^ such that p Ih i = i; and for no Sq, -generic 
filter H over V there is (3 < a such that zh G V[Hi 3 ]}^ This finishes the 
proof of Claim 3. 

Using Claim 3 we can find an §Q,-name x for an element of 2‘^ such 
that X = XGc for all §Q,-generic filters H and all f3 < a,xu ^ 

In other words, T is a name for a real not added before stage a of the 
iteration. 

For p G 2^^ n U[Go] there is an Sa-name y for an element of 2^^ 
such that y = ■ For every p G we will construct q < p with the 

following property: 



LetTq (T) be the tree of g-possibilities for x. Then in V there 
i*)q,x,y is a continuous map / : [Tq{x)] 2‘^ such that if H is Sq,- 

generic over V with q G H, then / maps xh to yn- 

Clearly, q forces f{x) = y. The function / is only defined on a closed 
subset of 2“^, but by Claim 2 we can extend it (still in the ground model) 
to a continuous function g : 2‘^ ^ 2‘^. 

Thus, the set of conditions forcing g{x) = y for some continuous 
ground model function p : 2“ — 2“ is dense in §q. This shows that for 
every S^-generic filter G there is a continuous ground model function 
p : 2‘^ — 2“^ such that p(Tg) = Vg- 

This finishes the proof of part a) of Lemma 7.4 provided we know 

Lemma 7.5. Let a be an ordinal and x an §„-name for an element of 2^^ 
which is not added in a proper initial stage of the iteration. Let y be an 
§Q,-name for an element of 2“^. Then for every p G Sq there is p < p such 
that {*)q,x,y holds. 

Choose an So,-name y for an element of 2“ which is not added before stage a. Let z be such 
that p\\- z = X and q\\- z = y for all q G Ea that are incompatible with p. 
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Proof. The proof of this lemma is elosely related to the proof of the 2- 
loealization property of eountable support produets of Saeks foreing in 
Seetion 6. We have to define the notion of a fusion sequenee for iterations 
of Saeks foreing. 

Let F be a finite subset of a and r] : F ^ u. For p,q G Sq, let 
P <F,r? q if and only if for all 7 G F, pfy Ih p{j) q{l)- 

A sequenee (pn)neoj is a fusion sequence if there are an inereasing 
sequenee (F„)„g^ of finite subsets of a and a sequenee {rjn)n&uj sueh that 
for all n e u, T]n Fn ^ u, for all i e Fn we have Pnii) < Vn+iii), 
Pn+i Pn, and for all 7 G supt(p„) there is m G a; sueh that 

7 G F^ and 17^(7) > n. 

The fusion of a fusion sequenee {pn)n&uj in is defined as fol- 
lows: Let 7 < a and suppose we have already defined p^ fy. Let p^{'y) 
be an §^-name for a eondition in S sueh that p^ fy forees Puli'!) to be the 
interseetion of the Pni'y), n e u. 

We also use a notion of faithfulness. For F and 7 as above, p G Sq,, 
and a G ri7eF let p * a G be sueh that for all 7 G F, 

p * o'l'y Ih p * (7(7) = p(y) * (7(7) 

and for 7 G a\F, p * <7(7) = p(y). 

A eondition p G §„ is (F, p) -faithful if for all <7, r G ri7eF with 
(J f T, Xp^a and Xp^r, the longest initial segments of x deeided by p * < 7 , 
respeetively by p * r, are ineomparable (with respeet to C)^*’. 

The following elaim is the iteration version of Claim 1 . 

Claim 4 . Let F and p be as before and suppose that g G §„ is (F, p)- 
faithful. 

a) Let f3 G a\F and let F' := F U {j3} and p' ■= pU {{j3, 0 )}. Then 
there is r <p^p q sueh that r is (F', p')-faithful. 

b) Let 13 & F and let p' := p\F\{/3} U {i/3,p{/3) + 1 )}. Then there is 
T <F,ri q sueh that r is (F, p')-faithful. 

Part a) follows immediately from the definitions. For part b) let <5 : = 
max F and let {<7 o, . . . , am} be an enumeration of ri7eF 2^(7) _ We define 
a <F,r;-deereasing sequenee {qj)j<m in Sq along with names and g^r^i, 
a G n.ygp’ for eonditions. 

Compare this to the corresponding definitions in Section 6. 
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Let j G {1, . . . m} and assume that gj_i has been constructed already. 
Since x is not added in a proper initial stage of the iteration, there are Qo-jA 
and q^.^i such that for i G 2, we have 

Qj-i * aj\5 Ih q^.^i < {q{5) * a) 



and 

qj-i * aj |'(5 Ih g and are incomparable. 

Let qj <F,ri Qj-i be such that qj * afti decides Xq^, g and This 

finishes the inductive construction of the qj. 

Now let r q^ be such that r = qjn \6 and for all a G ri7eF 
and all coordinatewise extensions r G offf, 

r * rfi Ih r * Tt[i5,Q:) = q^.rUD))- 

It is easy to check that r works for Claim 4. 

To conclude the proof of Lemma 7.5, let p, x and i) be as in the 
Lemma. Using some bookkeeping and parts a) and b) of Claim 4 we 
construct a sequence (pn)nsa; and a sequence (F„, r]n)naui witnessing that 
(Pn)n&uj is a fusion sequence such that p = po and for all n G a;, p„ is 
(F„, ?7„) -faithful. We can construct the sequences {pn)n&uj and (F„, Pn)n&u> 
with the additional property that for every n G a; and for every a G 
Pn * o- decides y \n. 

Let q be the fusion of the sequence {pn)n&uj- We have to check that 
i*)q,x,y holds. Let a G [Tq{x)] and n e u. Now q <Fn,rt„ Pn and p„ is 
{Fn, p„) -faithful. It follows that there is exactly one aa,n G ri7eF„ 
such that ^ a. Let /(a) ;= [Jneujyq*^a,n- Since for all n e u, 

q * (^a,n < Pn* (^a,n and p„ * aa,n dccidcs y\u, f{a) G 2‘^. It is easily 
checked that / : [Tq{x)] — > 2“^ is a continuous map witnessing 

q.e.d. 

Now we turn the proof of part b) of Lemma 7.4. Let G be S^-generic 
over the ground model V. By Claim 3, for x E there is a < 5 and an 
§Q,-name for an element of such that x = xq^ and for all Sa-generic 
filters 77 over V there is no /9 < a such ihdXxH E U[77^]. 

Using the proof of Lemma 7.5 for the name x for an element of 
instead of 2^^ (and any S^-name i) for an element of 2“^), for every p ESa 
we obtain a condition q < p such that {*)q,x,y is satisfied. This is because 
the proof of Lemma 7.5 does not depend on the fact that T is a name for 
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an element of 2‘^. It works for names for elements of as well. From 
the eonstruetion of q it follows that Tq{x) is binary. Clearly, q forees x to 
be a braneh through Tq{x). By the generieity of Ga, there is g G sueh 
that Tq{x) is binary. This shows the 2-loealization property of S5. 

Note that the name y is aetually not needed in the proof of the 2- 
loealization property. It is only used in order to be eonsistent with the 
notation of Lemma 7.5. q.e.d. (Lemma 7.4) 

Lemma 7.4 shows how S ean eollapse eardinals: Let G be 
over the ground model V. Suppose that V satisfies CH. Then in V[G] 
the size of the eontinuum is K2. Let H be §-generie over V[G]. Then 
L[(j][i7] is obtained by foreing with an iteration of Saeks foreing of 
length 012 + 1 over V. For 7 < (012)'^ + 1 let x^ denote the generie 
real added at stage 7. Here (012)'^ denotes the ordinal that is 012 in V. 
By Lemma 7.4, for every 7 < (012)^ there is a eontinuous funetion 
^ 2‘^ sueh that e V and f'y{x(^i^^^v) = x^. Sinee there are 
only Ml eontinuous funetion from 2"^ to 2“^ in V, this implies (012) < Ki 

in L[G][77]. This shows that adding a Saeks real over the Saeks model 
eollapses M2- 

At the end of this seetion let us mention another deseription of 
whieh is related more elosely to the representation of Saeks foreing as 
the partial order of uneountable Borel subsets of M. 

Definition 7.6. Let 5 be an ordinal. A set 5 C is cthl-perfect iff 

1. for every ordinal (3 < 5 and every sequenee s & B\j3 \= {u\!3 \ u & 
B} the set {r G M : s"'r G 5 f/S + 1} is uneountable and 

2. for every inereasing sequenee (/9n)nei.j of ordinals below 5 and every 
inereasing sequenee (with respeet to inelusion) {sn)n&oj sueh that for 
all n e uj, Sn e B\pnitis the ease that Sn e 5 ( 

It ean be shown that the partial order eonsisting of all ctbl-perfeet 
Borel subsets of is foreing equivalent to S5 (cf. [Zap04, 3.1]). This 
way of representing iterations of Saeks foreing was used by Kanovei 
[Kaui99] to iterate Saeks foreing along non-wellfounded linear orders. 
He showed that whenever / is a linearly ordered index set, it is possible 
to add a sequenee (aj)ig/ of reals to the ground model V sueh that for 
eaeh j G I, aj is a Saeks real over V[{ai)i^j], 
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Using the representation of in terms of ctbl-perfect Borel sets, we 
can give the formulation of a set theoretic axiom that axiomatizes the 
properties of the Sacks model very well. 

Definition 7.7. Consider the following game between player I and player 
II lasting rounds. At round /3 G Ui player I plays an ordinal < uJi, 
a ctbl-perfect Borel set C and a Borel function M. 

Then player II responds by a ctbl-perfect Borel set Cf 3 C Bp. Player I 
wins iff U/3<o;i //?[^/3] = The Covering Property Axiom (CPAj is the 
statement “CH fails and player II has no winning strategy in the above 
game”. 

CPA was invented by Ciesielski and Pawlikowski [CiePawcx)]. They 
showed that CPA holds in the Sacks model and implies many statements 
that were previously known to be true in the Sacks model. In fact, many 
of these statements already follow from weaker versions of CPA which 
are also defined in [CiePawcx)] and which seem to be much easier to ap- 
ply. In particular, these weaker version of CPA avoid some of the techni- 
calities and notational inconveniences usually associated with countable 
support iterations. 

Recently, Zapletal [Zap04] proved that if V satisfies CPA, then for a 
large class of cardinal characteristics of the reals the following holds: If 
there is a generic extension V [G] of V where the cardinal characteristic 
in question is < 2^°, then this cardinal characteristic is < in V. 
This shows that CPA indeed axiomatizes the Sacks model very well. The 
formulation of CPA is sufficiently general to be easily adapted for other 
models of set theory as welP’. 
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Cf. [CiePawcx)] and [Zap04] for this. 
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1 Supertasks 

What would you compute with an infinitely fast computer? What could 
you compute? To make sense of these questions, one would want to un- 
derstand the algorithms that the machines would carry out, computational 
tasks involving infinitely many steps of computation. Such tasks, known 
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as supertasks, have been studied since antiquity from a variety of view- 
points. 

Zeno of Elea (ca. 450 B.C.) was perhaps the first to grapple with 
supertasks, in his famous paradox that it is impossible to go from here 
to there, because before doing so one must first get halfway there, and 
before that halfway to the halfway point, and so on, ad infinitum. Zeno 
takes the impossibility of completing a supertask as the foundation of his 
reductio. More recently, twentieth century philosophers (cf. [Thoi54]) 
have introduced Thomson’s lamp, which is on for | minute, off for | 
minute, on for ^ minute, and so on. After one minute, is it on or off? 

o 

In a more intriguing example, let’s suppose that you have infinitely 
many one dollar bills (numbered 1, 3, 5, . . .) and in some nefarious un- 
derground bar, the Devil explains to you that he has an attachment to your 
particular bills, and is willing to pay you two dollars for each of your one 
dollar bills. To carry out the exchange, he proposes an infinite series of 
transactions, in each of which he will hand over to you two dollars and 
take from you one dollar. The first transaction will take ^ hour, the sec- 
ond I hour, the third | hour, and so on, so that after one hour the entire 
exchange is complete. Should you accept his proposal? Perhaps you will 
become richer? At the very least, you think, it will do no harm, and so 
the contract is signed and the procedure begins. 

It appears initially that you have made a good bargain, because at 
every step of the transaction, you receive two dollars but give up only one. 
The Devil is particular, however, about the order in which the bills are 
exchanged: he always buys from you your lowest-numbered bill, paying 
you with higher-numbered bills. (So on the first transaction he accepts 
from you bill number 1, and pays you with bills numbered 2 and 4, and 
on the second transaction he buys from you bill number 2, which he had 
just paid you, and pays you bills numbered 6 and 8, and so on.) When 
the transaction is complete, you discover that you have no money left 
at all! The reason is that at the nth exchange, the Devil took from you 
bill number n, and never subsequently returned it to you. Thus, the final 
destination of every individual bill is under the ownership of that shrewd 
banker, the Devil. 

The point is that you should have paid more attention to the details of 
the supertask transaction that you had agreed to undertake. And similarly, 
when we design supertask algorithms to solve mathematical questions. 
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we must take eare not to make inadvertent assumptions about what may 
be true only for finite algorithms. 

Supertasks have also been studied by the physieists (cf. [Ear95]). Us- 
ing only the Newtonian gravity law (and negleeting relativity), it is pos- 
sible to arrange finitely many stars in orbiting pairs, eaeh pair orbiting 
the eommon eenter of mass of all the pairs, and a single tiny moon raeing 
faster around, squeezing just so between the dual stars so as to piek up 
speed with every sueh transaetion. Assuming point masses (or eollaps- 
ing stars to avoid eollision), the arrangement leads by Newton’s law of 
gravitation to infinite aeeeleration in finite time. Other supertasks reveal 
apparent violations of the eonservation of energy in Newtonian physies: 
infinitely many billiard balls, of sueeessively diminishing size eonverg- 
ing to a point, are initially at rest, but then the first is set rolling, and 
eaeh ball transfers in turn all the energy to the next; after a finite amount 
of time, all motion has eeased, though every interaetion is energy eon- 
serving. Still other arrangements have the balls spaeed out further and 
further out to infinity, and the interesting thing about both of these ex- 
amples is that time-symmetry allows them to run in reverse, with statie 
eonfigurations of balls suddenly eoming into motion without violating 
eonservation of energy in any interaetion. 

More eomputationally signifieant supertasks have been proposed by 
physieists in the eontext of relativity theory (cf. [EarNori93], [Hogo92], 
[Hogo94]). Suppose that you want to know the answer to some number 
theoretie eonjeeture, sueh as whether there are additional Eermat primes 
(primes of the form 2^" -f 1), a eonjeeture that ean be eonfirmed with 
a single numerieal example. The way to solve the problem is to board 
a roeket, while setting your graduate students to work on earth looking 
for an example. While you fly faster and faster around the earth, your 
graduate students, and their graduate students and so on, eontinue the 
exhaustive seareh, with the agreement that if they ever find an example, 
they will send a radio signal up to the roeket. The point is that mean- 
while, by aeeelerating suffieiently fast towards the speed of light, it is 
possible to arrange that beeause of relativistie time eontraetion, what is 
a finite amount of time on the roeket eorresponds to an infinite amount 
of time on the earth. The general observation is that by means of sueh 
eommunieation between two referenee frames, what eorresponds to an 
infinite seareh ean be eompleted in a finite amount of time. 
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Even more complicated arrangements, with rockets flying around 
rockets, can be arranged to solve more complicated number theoretic 
questions. And more complicated relativistic spacetimes can be (mathe- 
matically) constructed to avoid the unpleasantness of infinite acceleration 
required in the rocket examples above {cf. [Piti90]). 

These computational examples speak to the generalized Church’s the- 
sis, the widely accepted philosophical principle that the classical theory 
of computability has correctly captured the notion of what it means to 
be computable. Because the relativistic rocket examples provide algo- 
rithms for computing functions, such as the halting problem, that are not 
computable by Turing machines, one can view them as refuting the gen- 
eralized Church’s thesis. Supporters of this view emphasize that when 
thinking about what is in principle computable, we must attend to the 
computational power available to us as a consequence of the fact that 
we live in a relativistic or quantum-mechanical universe. To ignore this 
power is to pretend that we live in a Newtonian world. Another (proba- 
bly specious) argument against the generalized Church’s thesis consists 
of the observation that a particle undergoing Brownian motion can be 
used to generate a random bit stream that we have no reason to think is 
recursive. Therefore, proponents argue, we have no reason to believe the 
generalized Church’s thesis. 

Apart from the question of what one can actually compute in this 
world, whether Newtonian or relativistic or quantum-mechanical, math- 
ematicians are interested in what in principle a supertask can accom- 
plish. Buchi [Biic62] and others initiated the study of cu-automata and 
Buchi machines, involving automata and Turing machine computations 
of length oj which accept or reject infinite input. Moving to a higher level 
in the hierarchy, Gerald Sacks and many others {cf. [Sac90]) founded the 
field of higher recursion theory, including a-recursion and i? -recursion, 
a huge body of work analyzing computation on infinite objects. Blum, 
Shub and Smale in [BluShuSma89] have presented a model of compu- 
tation on the real numbers, a kind of flowchart machine where the basic 
units of computation consist of real numbers, in full glorious precision. 
Apart from this previous mathematical work, I would like to propose here 
a new model of infinitary computability: infinite time Turing machines. 
This model offers the strong computational power of higher recursion 
theory while remaining very close in spirit to the computability concept 
of ordinary Turing machines. 
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2 Infinite time Turing machines 

I propose to extend the Turing maehine eoneept to transfinite ordinal 
time, thereby providing a natural model for infinitary eomputability.^ The 
idea is to allow somehow a Turing maehine to eompute for infinitely 
many steps, while preserving the information produeed up to that point. 

So let me explain speeifieally how the maehines work. The maehine 
hardware is identieal to a elassieal Turing maehine, with a head moving 
baek and forth reading and writing zeros and ones on a tape aeeording to 
the rigid instruetions of a finite program, with finitely many states. What 
is new is the transfinite behavior of the maehine, behavior providing a 
natural theory of eomputation on the reals that direetly generalizes the 
elassieal finite theory to the transfinite. For eonvenienee, the maehines 
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Fig. 1. An infinite time Turing machine: the computation begins 



have three tapes -one for the input, one for serateh work and one for 
the output- and the eomputation begins with the input written out on 
the input tape, with the head on the left-most eell in the start state. The 
sueeessor steps of eomputation proeeed in exaetly the elassieal manner: 
the head reads the eontents of the eells on whieh it rests, refieets on its 
state and follows the rigid instruetions of the finite program it is running: 
aeeordingly, it writes on the tape, moves the head one eell to the left or 
the right or not at all and switehes to a new state. Thus, the elassieal 
proeedure determines the eonfiguration of the maehine at stage a + 1, 
given the eonfiguration at any stage a. 



' Infinite time Turing machines were originally defined by Jeff Kidder in 1990, and he and 
I worked out the early theory together while we were graduate students at UC Berkeley. 
Later, Andy Lewis and I solved some of the early questions, and presented a complete 
introduction in [HamLewOO], later solving Post’s problem for supertasks in [HamLew02]. 
Benedikt Lowe in [LdwOl], Dan Seabold in [HamSeaOl] and especially Philip Welch in 
[Welo99], [WeloOOb], [WeloOOa] have also made important contributions. 





146 



JOEL DAVID HAMKINS 



We extend the eomputation into transfinite ordinal time by simply 
speeifying the behavior of the maehine at limit ordinals. When a elas- 
sieal Turing maehine fails to halt, it is usually thought of as some sort 
of failure; the result is disearded even though the maehine might have 
been writing some very interesting information on the tape (sueh as all 
the theorems of mathematies, for example, or the members of some other 
eomputably enumerable set). With infinite time Turing maehines, how- 
ever, we hope to preserve this information by taking some kind of limit 
of the earlier eonfigurations and eontinuing the eomputation transfinitely. 
Speeifieally, at any limit ordinal stage A, the head resets to the left-most 
eell; the maehine is plaeed in the speeial limit state, just another of the 
finitely many states; and the values in the eells of the tape are updated by 
eomputing the lim sup of the previous eell values. With the limit stage 
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Fig. 2. The limit configuration 



eonfiguration thus eompletely speeified, the maehine simply eontinues 
eomputing. If after some amount of time the halt state is reaehed, the 
maehine gives as output whatever is written on the output tape. 

Beeause there seems to be no need to limit ourselves to finite input 
and output -the maehines have plenty of time to eonsult the entire input 
tape and to write on the entire output tape before halting- the natural 
eontext for these maehines is Cantor Spaee 2“^, the spaee of infinite binary 
sequenees. For our purposes here, let’s denote this spaee by M and refer 
to its members as real numbers, intending by this terminology to mean 
infinite binary sequenees. We regard the set of natural numbers N as a 
subset of M by identifying the number 0 with the sequenee ( 000 . . . ), the 
number 1 with ( 100 . . . ), the number 2 with ( 110 . . . ), and so on. 

Beeause every program p determines a funetion -the funetion send- 
ing input X to the output of the eomputation of program p on input x- the 
maehines provide a model of eomputation on the reals. We define that a 
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partial function / • M — > M is infinite time computable (or supertask com- 
putable, or for brevity, just computable, when the infinite time eontext is 
understood) when there is a program p sueh that f{x) = y if and only if 
the eomputation of program p on input x yields output y. A set of reals 
A C M is infinite time decidable (or supertask decidable or again, just 
decidable) when its eharaeteristie funetion, the funetion with value 1 for 
inputs in A and 0 for inputs not in A, is eomputable. The set A is infinite 
time semi-decidable when the funetion of affirmative values 1 (A, i.e., the 
funetion with domain A and eonstant value 1, is eomputable. (Thus, the 
semi-deeidable sets eorrespond in the elassieal theory to the reeursively 
enumerable sets, though sinee here we have sets of reals, we hesitate to 
deseribe them as enumerable.) Sinee it is an easy matter to ehange any 
output value to 1, the semi-deeidable sets are exaetly the domains of the 
eomputable funetions, just as in the elassieal theory. 

Theorem 2.1. Every supertask eomputation halts or repeats in eountably 
many steps. 

Proof. Suppose that a supertask eomputation does not halt by any eount- 
able stage of eomputation. The point is now that a simple eofinality ar- 
gument shows that the eomplete eonfiguration of the maehine at stage cci 
-the position of the head, the state and the eontents of the eells- must 
have oeeurred earlier. For example, one ean find a eountable ordinal ao 
by whieh time all of the eells that have stabilized by oj\ have stabilized. 
And then one ean eonstruet a eountable inereasing sequenee of eountable 
ordinals ao < ai < . . . sueh that all the eells that ehange their value after 
an do so at least onee between and an+i - These ordinals exist beeause 
oji is regular and there are only eountably many eells. At the limit stage 

= sup an, whieh is still a eountable ordinal, I elaim that the eonfigu- 
ration is the same as at cci : sinee it is a limit ordinal, the head is on the first 
eell and in the limit state; and by eonstruetion the eontents of eaeh eell 
are eomputing the same limsup that they eompute at cci. Sinee beyond 
ao the only eells that ehange are the ones that will ehange unboundedly 
often, it follows that limits of this eonfiguration are the very same eonfig- 
uration again, and the maehine is eaught in an endlessly repeating loop. 
So the proof is eomplete. q.e.d. 

Please observe in this argument that, eontrary to the elassieal situa- 
tion, a eomputation that merely repeats a eomplete maehine eonfiguration 
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need not be caught in an endlessly repeating loop. After oj many repeti- 
tions, the limit configuration may allow it to escape. One example of this 
phenomenon would be the machine which does nothing at all except halt 
when it is in the limit state; this machine repeats its initial configuration 
many times, yet still halts at u. 

3 How powerful are the machines? 

One naturally wants to understand the power of the new machines. The 
first observation, of course, is that the classical halting problem for ordi- 
nary Turing machines -the question of whether a given program p halts 
on given input n in finitely many steps- is decidable in oj many steps 
by an infinite time Turing machine. To see this, one programs an infinite 
time Turing machine to simply simulate the operation of p on n, and if 
the simulated computation ever halts our algorithm gives the output that 
yes, indeed, the computation did halt. Otherwise, the limit state will be 
attained, and when this occurs the machine will that know the simulated 
computation failed to halt; so it outputs the answer that no, the computa- 
tion did not halt. 

The power of infinite time Turing machines, though, far transcends 
the classical halting problem. The truth is that any question of first or- 
der number theory is supertask decidable. With an infinite time Turing 
machine, one could solve the prime pairs conjecture (which asserts that 
there are infinitely many primes pairs, pairs of primes differing by two), 
for example, and the question of whether there are infinitely many Fermat 
primes (primes of the form 2^" -f 1) and so on: there is a general decision 
algorithm for any such conjecture. The point is that to decide a question 
of the form 3n<p(n, x), where n ranges over the natural numbers, one 
can simply try out all the possible values of n in turn. One either finds a 
witness n or else knows at the limit that there is no such witness, and in 
this way decides whether 3n<p(n, x). Iterating this idea, one concludes 
by induction on the complexity of the statement that any first order num- 
ber theoretic question is decidable with only a finite number of limits, 
i.e., before stage In fact, the class of sets that are decidable in time 

uniformly before is exactly the class of arithmetic sets, the sets of 

reals that are definable by a statement using quantifiers over the natural 
numbers {cf. [HamLewOO, Theorem 2.6 ]). 
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Theorem 3.1. Arithmetic truth is infinite time decidable. 

One can push this much harder to see that even more complex questions, 
questions from the lower part of the projective hierarchy in second order 
number theory, are supertask decidable. The fact is that any II j set is de- 
cidable and more. To prove this, it suffices to consider the most complex 
nj set, the well-known set wo, consisting of the reals coding a well- 
orders of a subset of N. An infinite binary sequence x codes a relation < 
on N when i <\ j if and only if x{{i, j)) = 1, where ( ■, ■ ) is the Godel 
pairing function coding pairs of natural numbers with natural numbers. 

Theorem 3.2. The set wo is infinite time decidable. 

Proof. This argument is known as the “count-through” argument. We 
would like to describe a supertask algorithm which on input x decides 
whether x codes a well order < on a subset of N or not. In u many steps, 
it is easy to check whether x codes a linear order: this amounts merely 
to checking that the relation < coded by x is transitive, irrefiexive and 
connected. For example, the machine must check that whenever i <\ j 
and j < k then also i < k, and all these requirements can be enumerated 
and checked in u many steps. 

Next, the algorithm will attempt to find the least element in the field 
of the relation <. This can be done by keeping a current-best-guess on 
the scratch tape and systematically looking for better guesses, whenever 
a new smaller element is found. When such a better guess is found, it 
replaces the current guess on the scratch tape, and a special flag cell is 
flashed on and then off again. At the limit, if the flag is on, it means that 
infinitely often the guess was changed, and so the relation has an infinite 
descending sequence. Thus, in this case the input is definitely not a well 
order and the computation can halt with a negative output. Conversely, 
if the flag is off, it means that the guess was only changed finitely often, 
and the machine has successfully found the < least element. The algo- 
rithm now proceeds to erase all mention of this element from the field 
of the relation <. This produces a new smaller relation, and the algo- 
rithm proceeds to find the least element of it. In this way, the relation 
< is eventually erased from the bottom as the computation proceeds. If 
the relation is not a well order, eventually the algorithm will erase the 
well founded initial segment of it, and then discover that there is no least 
element remaining, and reject the input. If the relation is a well order. 
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then the algorithm will eventually erase the entire field, and recognize 
that it has done so, and accept the input as a well order. This completes 
the proof. q.e.d. 

Since WO is well-known as a complete Ilj set, we conclude as a corol- 
lary that every II J set is infinite time decidable and hence also, every Sj 
set is infinite time decidable. But one can’t go much further in the projec- 
tive hierarchy, because every semi-decidable set has complexity A^. For 
a finer stratification, let me mention that the arithmetic sets are exactly 
the sets which can be decided by an algorithm using a bounded finite 
number of limits, and the hyperarithmetic sets, the A j sets, are exactly 
the sets which can be decided in some bounded recursive ordinal length 
of time. Thus, the arithmetic sets are those that can be decided uniformly 
in time before and the hyperarithmetic sets are exactly those which 
can be decided uniformly in time before 

Much of the classical computability theory generalizes to the super- 
task context of infinite time Turing machines. For example, the s-m-n 
theorem and the Recursion Theorem go through with virtually identical 
proofs. But some other classical results, even very elementary ones, do 
not generalize. One surprising result, for example, is the following. 

Theorem 3.3. There is a non-computable function whose graph is semi- 
decidable. 

This follows from what I have called the Lost Melody Theorem 
[HamLewOO, Theorem 4.9], which asserts the existence of a real c such 
that { c } is decidable, but c is not writable. Imagine the real c as the 
melody that you can recognize when someone sings it, but you cannot 
sing it on your own. Using such a lost melody real c, one can prove The- 
orem 3.3 with the function f{x) = c. Indeed, since this function is con- 
stant and the graph is decidable, the theorem can be strengthened to the 
assertion that there is a non-computable constant function whose graph is 
decidable. To give some idea of how one proves the Lost Melody Theo- 
rem, let me mention that the real c will be the least real in the Godel con- 
structible universe L hierarchy that codes the ordinal supremum of the 
places where all computations on input 0 have either halted or repeated. 
Since this ordinal is above every writable ordinal, the real c cannot be 
writable. But the real c codes enough information about itself so that an 
infinite time Turing machine can verify that a given real is c or not. 
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4 How long do the computations take? 

One naturally wants to understand how long a supertask computation 
can take. Therefore, I define an ordinal a to be dockable if there is a 
computation on input 0 that takes exactly a many steps to complete (so 
that the oth step of computation is the act of moving to the halt state). 
Such a computation is a clock of sorts, a way to count exactly up to a. 

It is very easy to see that any finite n is dockable; one can simply 
have a machine cycle through n states and then halt. The ordinal u is 
dockable, by the machine that halts whenever it sees the limit state. And 
these same ideas show that if a is dockable, then so is a + n and a-\- u. 
Thus, every ordinal up to is dockable. The ordinal itself is dock- 
able: one can recognize it as the first limit of limit ordinals, by flashing a 
flag on and then off again every time the limit state is encountered. The 
ordinal will be first time this flag is on at a limit stage. Going beyond 
this, it is easy to see that if a and j3 are dockable, so are a -t- /9 and af3. 

Undergraduate students might enjoy finding algorithms to dock specific 

2 

ordinals, such as , and I can recommend this as a way to help them 
understand the ordinals more deeply. 

Most readers will have guessed that the analysis extends much fur- 
ther. In fact, any recursive ordinal is dockable. This can be seen by op- 
timizing the count-through argument in Theorem 3.2. Specifically, after 
writing a real coding a recursive ordinal on the tape in u many steps, 
one proceeds to count through it in an optimized fashion. Rather than 
merely guessing the least element of the relation, one guesses the oj many 
least elements of the relation (while simultaneously erasing the previous 
guesses). In this way, each block of u many steps of the algorithm will 
erase u many elements from the field of the relation. 

Some have been surprised that the dockable ordinals extend beyond 
the recursive ordinals, but in fact they extend well beyond the recursive 
ordinals. To see at least the beginnings of this, let me show that the ordi- 
nal -f ci; is dockable, where is the supremum of the recursive 
ordinals. Kleene has proved that there is a recursive relation whose well- 
founded part has order type Consider the supertask algorithm that 
writes this relation on the tape and then attempts to count through it. By 
stage cuf ^ the ill-founded part will have been reached, but it takes the al- 
gorithm an additional u many steps to realize this. So it can halt at stage 
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One is left to wonder, is itself eloekable? More generally, Are 
there gaps in the dockable ordinals? After all, if a ehild ean eount to 
twenty-seven, then one might expeet the ehild also to be able to eount 
to any smaller number, sueh as nineteen^. The question is whether we 
expeet the same to be true for infinite time Turing maehines. 

Theorem 4.1. Gaps exist in the eloekable ordinals. 

Proof. Consider the algorithm whieh simulates all programs on input 0, 
reeording whieh have halted. When a stage is found at whieh no pro- 
grams halt, then halt. This produees a eloekable ordinal above a non- 
eloekable ordinal, so gaps exist. q.e.d. 

The argument ean be modified to show that the next gap above any 
eloekable ordinal has size u. Other arguments establish that eomplieated 
behavior ean oeeur at limits of gaps, beeause the lengths of the gaps are 
unbounded in the eloekable ordinals. 

Question 1. What is the strueture of the eloekable ordinals? 

For example, one might wonder whether the first gap begins at the 
supremum of the reeursive ordinals? (It does, sinee no admissible ordinal 
is eloekable [HamLewOO].) 

There is another way for infinite time Turing maehines to operate as 
eloeks, and this is by eounting through a real eoding a well order in the 
manner of Theorem 3.2. To assist with this analysis, we define that a real 
is writable if it is the output of a supertask eomputation on input 0. An 
ordinal is writable if it is eoded by a writable real. It is easy to see that 
there are no gaps in the writable ordinals, beeause if one ean write down 
real eoding a, it is an easy matter to write down from this a real eoding 
any partieular (3 < a. In [HamLewOO], Andy Lewis and I proved that the 
order types of the eloekable and writable ordinals are the same, but the 
question was left open as to whether these two elasses of ordinals had the 
same supremum. This was solved by Philip Weleh in [WeloOOa], allowing 
Andy Lewis and I to greatly simplify arguments in [HamLew02]. 

^ Friends with children have informed me that such an expectation is unwarranted; one some- 
times can’t get the child to stop at the right time. This reminds me of a time when my younger 
brother was in kindergarten, the children all sat in a big circle taking turns saying the next 
letter of the alphabet: A, B, C, and so on, around the circle in the manner of the usual song. 
After the letter K, the next child contributed LMNOP, thinking that this was only one letter. 
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Theorem 4.2 (Welch). Every dockable ordinal is •writable. The supre- 
mum of the writable and dockable ordinals is the same. 

5 The supertask halting problems 

Any notion of computation naturally provides a corresponding halting 
problem, the question of whether a given computation will halt. In the 
supertask context, we divide the halting problem into two parts, a bold- 
face and a lightface problem: 

H = {{p,x) I program p halts on input x } 

h = {p \ program p halts on input 0 } 

In the classical theory, of course, these two sets are Turing equivalent, 
but here the situation is different. Nevertheless, for undecidability the 
classical arguments do directly generalize. 

Theorem 5.1. The halting problems h and H are semi-decidable but not 
decidable. 

For semi-decidability, the point is that given a program p and input x 
(or input 0), one can simply simulate p on a; to see if it halts. If it does, 
output the answer that yes, it halted; otherwise, keep simulating. For un- 
decidability, in the case of H one can use the classical diagonalization 
argument; for the lightface halting problem h, one appeals to the Recur- 
sion Theorem, just as in the classical theory. 

6 Oracles 

There are two natural types of oracles to use in the infinite time Turing 
machine context. On the one hand, one can use an individual real as an 
oracle just as one does in the classical context, by simply adding an or- 
acle tape containing this real, and allowing the machine to access this 
tape during the computation. This corresponds exactly to adding an extra 
input tape and thinking of the oracle real as a fixed additional input. 

But this is ultimately not the right type of oracle to consider. Rather, 
an oracle is more properly the same type of object as one that might 
be decidable or semi-decidable, namely, a set of reals, not an individual 
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real. Since such a set could be uncountable, we can’t expect to be able 
to write out the entire contents of the oracle on an extra tape. Rather, we 
provide an oracle model of relative computability by which the machine 
can make arbitrary membership queries of the oracle. Specifically, for a 
fixed oracle set of reals A, we equip an infinite time Turing machine with 
an initially blank oracle tape on which the machine can read or write. By 
attempting to switch to a special query state, the machine receives the 
answer (by moving actually to the yes or no state) as to whether the real 
currently written on the oracle tape is in A or not. In this way, the ma- 
chine is able to ask, of any real x that it is capable of producing, whether 
X G A or not. This model of oracle computation has proven robust, and 
it closely follows the well-known definition of L[yl] in set theory, the 
constructible universe relative to the predicate A, in which at any given 
stage in the construction one is allowed to apply the predicate only to 
previously constructed objects. 

From the notion of oracle computation, one can of course define a 
notion of relative computability. Specifically, the set A is computable 
from B, written A <^o B, if and only if A is supertask decidable using 
oracle B. One then also defines A =oo B if and only if A <oo B and 
B <oo A, and this is the equivalence relation of the infinite time Turing 
degrees. The strict version A <oo B holds if and only if A <oo B and 
A ^oo B. 

1 Supertask Jump Operators 

The two halting problems give rise of course to two jump operators. 
Specifically, for any set A we have the boldface and lightface jumps^: 

= { ( P, a; ) I program p halts on input x with oracle A } 

/^ = A®h^ = A®{p\ program p halts on input 0 with oracle A } 

We include the factor A explicitly in , because in general A may not 
be computable from h^. Indeed, there are some sets A that are not com- 
putable from any real at all. 

Jump Theorem 7.1 For any set, A <oo A^ <oo A^. 

^ Compare Welch’s Definition 1.9 and Definition 2.1 on pages 227 and 231 of this volume, 
respectively. 
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To prove this theorem, one first observes that A ^ <oo , sinee 
A is explieitly eomputable from A'^ and A'^ is merely the 0th sliee of 
A* . Seeondly, one knows that A <oo A'^ beeause the undeeidability of 
the relativized halting problem means that is not eomputable from 
A. The nontrivial aspeet of this theorem is the assertion that A'^ <oo A^. 
This assertion is what separates the two jump operators, and is the reason 
that we know the two halting problems h =oo 0 '^ and H CT are 
not equivalent. This follows from the more speeifie result that the set 
A^ is not eomputable from A ® z for any real 2:. In partieular, 0^ is 
not eomputable from any real. In faet, the boldfaee jump ▼ jumps mueh 
higher than the lightfaee jump V, and absorbs many iterates of the weaker 
jump, sinee A^ =00 A'^; indeed, for any ordinal a whieh is A^-writable, 
=00 (cf. [HamLewOO]). 

8 Post’s Problem for Supertasks 

Post’s problem is the question in elassieal eomputability theory of 
whether there are any non-deeidable semi-deeidable degrees strietly be- 
low the halting problem, or equivalently, whether there are any interme- 
diate semi-deeidable degrees between 0 and the Turing jump O'. This 
question has a natural supertask analogue: 

Supertask Post’s Problem 8.1 Are there any intermediate semi-decida- 
ble supertask degrees between 0 and the supertask jump O'' ? 

The answer is delieately mixed. On the one hand, in the eontext of de- 
grees in the real numbers, we have a negative answer. This eontrasts 
sharply with the elassieal theory. 

Theorem 8.2. There are no reals 2: sueh that 0 <00 <00 O''. 

Proof. Suppose that 0 <00 ^ <00 0''. So 2: is the output of program p us- 
ing O'' as an oraele. Consider the algorithm whieh eomputes approxima- 
tions to 0^, and uses program p with these approximations in an attempt 
to produee 2;. If one of the proper approximations to O'' ean sueeessfully 
produee 2:, then 2; is writable and 0 =00 Conversely, if none of the 
proper approximations ean produee ;2, then on input we ean reeognize 
O'' as the true approximation, the first approximation able to produee 2:. 
So ^ =00 0 ^. q.e.d. 
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On the other hand, when it eomes to sets of reals, we have an affir- 
mative answer. 

Theorem 8.3. There are semi-deeidable sets of reals A with 

0 <oo ^ <oo 0^. 

Indeed, there are ineomparable semi-deeidable sets A A B. 

Please eonsult [HamLewOO] for the proof. Let me mention here, though, 
that the basie idea of the argument is to generalize the Friedburg-Mueh- 
nik priority argument to the supertask eontext, mueh as Saeks’ did for 
a-reeursion theory. Building A and B in stages, we attempt to meet the 
requirements 

^ A and ^p ^ B 

by adding writable reals to A and B that have not yet appeared on the 
higher priority eomputations. One teehnieal faet to make this idea work 
is that for any eloekable ordinal a, there are many writable reals not 
appearing during the eourse of any supertask eomputation of length a. 
Thus, we ean find a supply of new writable reals to add to A and B in 
order to satisfy the later requirements, without injuring the witnessing 
eomputations of earlier higher-priority requirements. 

9 Other Models of Infinitary Computation 

Let me briefly eompare the infinite time Turing maehine model of super- 
task eomputation with some other well-known models. 

The Blum-Shub-Smale maehines (cf. [BluShuSma89]) were the orig- 
inal inspiration for infinite time Turing maehines."^ Programs and eom- 
putations for BSS maehines are finite, but the basie units of eomputation 
are full preeision real numbers. They are in essenee finite state regis- 
ter maehines, where the registers eaeh hold a real number. The primary 
purpose of introdueing the BSS maehines was to provide a theoretieal 
foundation for analyzing eomputational algorithms using the eoneepts of 
real analysis rather than arithmetie. The maehines allow one to analyze 

Jeff Kidder and I heard Lenore Blum’s lectures for the Berkeley Logic Colloquium in 1989, 
and had the idea to generalize the Turing machine concept in a different direction: to infinite 
time rather than infinite precision. 
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the dynamical features, for example, of actual algorithms in numerical 
analysis, such as Newton’s method, and illuminate questions of stabil- 
ity and convergence for such algorithms. The classical approach to these 
problems, using the Turing machine model with ever greater decimal ap- 
proximations, forces one into the realm of finite combinatorics, where 
one becomes lost in a jumble of discrete approximation error analysis, 
when one would rather fly smoothly above it in the heaven of differential 
equations. 

In another direction, the theory of higher recursion provides a model 
of infinitary computability by setting a very general theoretical context 
for recursion on infinite objects, and one should expect many parallels be- 
tween it and the theory of infinite time Turing machines. The anonymous 
referee of [HamLew02] and Philip Welch have pointed out, for example, 
that the infinitary priority argument [HamLew02, Theorem 4.1], stated as 
Theorem 8.3 above, parallels Sacks’ version of the Friedburg-Munchnik 
proof for a-recursion [Sac90], specifically when a is A, the supremum 
of the dockable ordinals. One can identify the writable reals in our ar- 
gument with the ordinal stages at which they appear and get Sacks’ sets, 
and conversely. Sacks’ could have written out codes for those stages and 
gotten our sets. This identification reveals that the <oo-degree structure 
of sets of writable reals below 0^ is exactly that of the A-degrees. Ac- 
cordingly, one can obtain not only the answer to Post’s problem, but all 
the theorems from A-recursion theory for this class of degrees, such as 
the Shore Density Theorem, etc., for free. It will be very interesting to 
see if these ideas will allow one to prove the theorems in the general case 
of all degrees. 

Lastly, let me mention quantum Turing machines, if only because I 
am often asked about them in connection with infinite time Turing ma- 
chines. Quantum Turing machines are like classical Turing machines, 
except that the configuration of the machine at any given stage is a su- 
perposition of classical configurations; the different components of these 
superpositions, like the wave functions of quantum mechanics, may con- 
structively or destructively interfere with one another as the computation 
proceeds. By means of clever quantum algorithms, one can effectively 
carry out parallel computation in these different components, construc- 
tively interfering their output to assemble the information into a final 
answer. In this way, quantum Turing machines allow for an exponential 
increase in the speed of computation of many important functions. But 
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because quantum Turing machines, at the end of the day, are simulable 
by classical Turing machines, they do not introduce new decidable sets or 
new computable functions. And so while quantum Turing machines are 
without a doubt extremely important in matters of computational feasi- 
bility, they do not really provide a model of infinitary computability. Infi- 
nite time Turing machines are simply much more powerful than quantum 
Turing machines. 

10 Questions for the Future 

I close this article by asking the open-ended question: 

Question 2. What is the structure of infinite time Turing degrees? To 
what extent do its properties mirror or differ from the classical structure? 

This question really stands for the dozens of specific open questions that 
one might ask: does the Sacks Density Theorem, for example, hold in the 
supertask context for arbitrary sets of reals? The field is wide open. 
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1 Introduction 

Under the axioms of Zermelo-Fraenkel set theory (ZF) the universe V 
of all sets ean be layered into a hierarehy of initial segments whieh is 
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indexed by the ordinal numbers. John von Neumann made the following 
reeursive definition: Vq ; = 0, V«+i := p(Va), and Va ;= Ua<A^«’ 
for limit ordinals A. The axiom sehema of foundation (among others) 
implies that this hierarehy exhausts the set theoretieal universe: V : = 
UaeOrd^o- ^ X is of Ordinal rank a if x e Vq,+i\Vq. The rank 
funetion measures the eomplexity of x in terms of the membership rela- 
tion. 

Transfinite induetion along the ordinals is essentially the only method 
to earry out involved arguments and eonstruetions on infinite sets. Via the 
von Neumann hierarehy, some induetive arguments ean be applied to all 
sets in the universe V. 

In his fundamental work on the relative eonsisteney of the axiom of 
ehoiee and of the generalised eontinuum hypothesis, Kurt Godel defined 
the constructible universe L whieh is the smallest inner model of ZF. The 
model L has beeome the prototype for a eanonieal model of set theory. 
The eonstruetible universe is defined and analysed through definability 
hierarehies. 

The present paper illustrates that there exists a variety of appropri- 
ate hierarehies for L. We briefly reeall the two elassieal examples: the 
Lo,-hierarehy by Godel and the J„-hierarehy by Jensen. We shall then 
introduee a new hierarehy (Fq) for L and relate it to Jensen’s hierarehy. 
The main teehnieal result will be that the Jensen-hierarehy eonsists of 
the limit levels of the -hierarehy whieh underlines again the eanonieal 
nature of the Jensen-hierarehy. 

Godel in [Gdd39] defined the hierarehy (Lo,)ogOrd by iterating a de- 
finable powerset operation instead of the unrestrieted powerset operation. 
Denoting the eolleetion of all first-order definable subsets of {X, g) by 
Def(X), the hierarehy is defined by: Lq := 0, L^+i ;= Def(Lo,), and 
La := Uo<A limit ordinals A. The sets formed in the various lev- 

els of this hierarehy are ealled the constructible sets, L := IJaeOrd is 
defined to be the constructible universe. 

The elements in La+i = Def(Lo,) ean be elassified aeeording to their 
defining formulas over (Lq, g) and to the parameters used. The formulas 
ean be indexed by natural numbers, and the parameters have already been 
loeated in some earlier level. This allows to denote every set in L by a 
finite sets of ordinals or, indeed, by one ordinal so that the methods of 
induetion and reeursion ean be applied to all elements of L. In this way. 
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Godel proved that the class L is an inner model of set theory in which 
the axiom of choice and the generalized continuum hypothesis hold. 

The constructible universe satisfies many combinatorial properties 
and allows involved constructions that cannot be carried out under the 
ZFC axioms alone. Ronald Jensen analysed the process of set formation 
in the constructible universe in his paper The fine structure of the con- 
structible hierarchy [Jen72] for which he received the 2003 AMS Steele 
Prize for a Seminal Contribution to Research. The theory is based on the 
analysis of first-order formulas which define a set in Lq,_|_i\L„. Jensen’s 
approach requires a thorough syntactical study of formulas of arbitrary 
quantifier complexity. 

For his work, Jensen introduced the J-hierarchy for L, whose levels 
are closed with respect to rudimentary functions. These are the functions 
generated by the scheme: 

- constant functions and projection functions are rudimentary; 

- the formation of unordered pairs is rudimentary; 

- if f{xo, , Xn-i) and go{y), ... , gn-i{y) are rudimentary functions 
then the composition f {go{y) ■, ■ ■ ■ , gn-i{y)) is rudimentary; 

- if is a rudimentary function then f{y,x) = 
mentary. 

The smallest, rudimentary closed set containing X is denoted by 
rud(X). Then the J-hierarchy is defined recursively as follows: Jq : = 
0, Ja+i ■= rud(Jo, U { Ja}), and := U«<a ordinals A. 

The fine structure theory augments the structures Jq, with a host of 
additional components like canonical well-orders, Skolem functions and 
truth predicates in order to reduce arbitrary definability over some Jq to 
El -definability over the augmented structure. The fundamental lemmas 
of the theory ensure that the additional components are preserved under 
various operations in the hierarchy. We state some properties of the J- 
hierarchy. Proofs can be found in [Dev84, Chap. VI]. 

Lemma 1.1. There exists a Si(Jq,) map from ua onto ua x ua. 

Lemma 1.2. There exists a Si(Jq.) map from ua onto J«. 

Theorem 1.3. For every a > 1, p{Ja)fiJa+i = S^(Jq). So in particular 
Soj(Ja) C Ja +1 for every a > 1. 
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Recently, some variants of fine structure theory have been proposed 
which try to keep the “model theoretic” intuitions of Jensen’s theory 
whilst reducing the syntactical complexities. S. D. Friedman and the first 
author [FriiKoe97], e.g., modified the standard La-hierarchy in a way 
which allows to use the combinatorics of Silver machines [Sil7?] in com- 
binatorial proofs whilst retaining Godel’s iterated definability approach. 

In this note, we present yet another fine structural hierarchy 

(Fa)o£Ord- 

The central idea is to incorporate canonical well-orders and Skolem func- 
tions directly into the hierarchy, so that these do not have to be defined 
over the structures. The F-hierarchy is defined via quantifier-free defin- 
ability, which simplifies fine structural arguments. It has been used by 
the first author to provide a simple proof of the Jensen covering the- 
orem. Details and proofs will be published in [Koecx)]. In the present 
article we prove that the F-hierarchy is a refinement of the J-hierarchy; 
the J-hierarchy consists of the limit structures of the F-hierarchy, hence 
the F-hierarchy is an adequate hierarchy for the constructible universe. 
The proof uses the fact that sufficiently much of the recursion for the 
F-hierarchy can be carried out inside the structures J^. 

The original motivation for introducing the F-hierarchy was core 
model theory. The F-hierarchy can be used nicely for structuring the 
Dodd-Jensen core model K, but there are problems with higher core 
models so far. 

2 The F-hierarchy 

We introduce the F-hierarchy and state some basic properties. We ap- 
proximate the constructible universe by a hierarchy (Fo)o,gOrd of struc- 
tures Fq, = (Fq, Ja, <o, S'a, g). Each Fa will be a transitive set and 
UaeOrd^o = L. The functions la and Sa are the restrictions to F^^^ 
of a global interpretation function I and a global Skolem function S de- 
fined on The relation <a is the restriction of a ternary guarded con- 
structible well-order < to F^. Let us first define a “Skolem” language S 
adequate for the structures Fq,: 

Definition 2.1. Let S be the first-order language with the following com- 
ponents: 
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- variable symbols forn < a;; 

- logical symbols = (equality), A (conjunction), A (negation), 3 (exis- 
tential quantification), (, ) (brackets); 

- function symbols / (interpretation), S (Skolem function) of variable 
finite arity; 

- a binary relation symbol G (set-membership) and a ternary relation 
symbol -< (guarded well-order). 

The syntax and semantics of S are defined as usual. The variable 
arity of / and S is handled by bracketing: if n < cu and to, , t„,-i are 
5-terms then /(to, • • • , tn-i) and S{to, . . . , tn-i) are 5-terms. If to, ti, 
t 2 are 5-terms then to&i and to^nh are atomic 5-formulas. We also 
denote the set of first-order 5-formulas by 5. 

We assume that 5 is Godelized in an effective way: the set of for- 
mulas satisfies 5 C and the usual syntactical operations of 5 are 
recursively definable over V^. This includes the simultaneous substitu- 
tion 9 ? I; of terms t for variables w in (p. The notation (p{vo, ... , Vn-i) 
indicates that the free variables of p> are contained in {/q, • • • , lin-i}- By 
So we denote the collection of quantifier-free formulas of 5. The for- 
mulas of 5 are interpreted in 5-structures in the obvious way. If ^ = 
is an 5-structure, (p{vo, • • • , Vn-i) A 5 and oq, . . . , 
ttn-i G A then A \= (p[ao, ... , Un-i] says that ^ is a model of (p under 
the variable assignment Vi i— > a* for i < n. 

The F^-hierarchy is defined by iterated 5q -definability: 

Definition 2.2. The fine hierarchy consists of a monotone sequence of 
5-structures 

F, = (F,, I, S, G) = (F„, /(F„, ^ (F„, 5tF„, g) 

where Fq,, /(Fq,, ^ (Fq,, 5|'Fq, are defined by recursion on a G Ord: For 
a < cu let Fq, = Vq, and VT G F„ : I{x) = S{x) = 0. Let be 
a recursive binary relation which well-orders F^ = in ordertype u 
and which extends the G -relation on {V^ \ n < u}. Define the ternary 
relation -< on Fq, by: x -<z y iff {z = F„ for some n < a, x,y e z and 
X <u> y)- This defines the structures Fq, for a < u. 

Assume that a > u and that F^ = (F^, 5, g) has been defined. 

For (p{vo, . . . ,Vn) G 5o and p G Fq, set 

/(Fq, 99,^ = {x G Fq I Fq ^ V^[a;,p]}. (*) 
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In this case, (Fq, 99, p) is a name for its interpretation /(Fq, in the 
fine hierarchy. The next fine level is defined as 

Fa+i = {/(F„,p,^ I if e SoJ e F<‘^}. 

We have to define I (;?) for “new” vectors Certain 

assignments were made in (*); in all other cases set I{z) = 0. 

Define the ternary relation ^ |'Fq,+i extending ^ |'Fq, by adjoining the 
following triples: x y iff x, p e F„ and there is a name (F^, (p,p) 
for X such that every name (F.y, q) for y is lexicographically greater 
than (F/3, p, p), where coordinates are well-ordered from left to right by 
e, and respectively. 

The Skolem function S finds witnesses of existential statements. We 
only need to define S{z) for z G (Fq,+i)<‘^\F<‘^. Set S{z) = 0 except 
when z = (F„,(p,p), where (p e Sq, p e F<“^ and /(F^, p,p) ^ 0 . In 
that case let S{z) be the -<F„-least element of /(Fq,, <p,p). 

This defines F„+i = (Fq,+i, I, -<, S, e). 

Assume that A > cc is a limit ordinal and that Fq, is defined for a < 
A. Then the limit structure Fa = (Fa, I, S, g) is defined by unions: 

Fa = Ua<AF., /(Fa = Ua<A^(FQ, ^(Fa = Ua<A^rFQ, ^(Fa = 

U„<A^fFQ. 

The fine hierarchy satisfies some natural hierarchical properties some 
of which were already tacitly assumed in the previous definition: 

Proposition 2.3. For every 7 G Ord: 

1. a < 7 — ^ Fq C F.^; 

2 . a < 7 ^ Fq G F^; 

3 . F.y is transitive. 

First-order definability can be emulated in Sq: 

Proposition 2.4. For every G -formula • • • , fin-i) one can uniform- 
ly define a quantifier free formula • • • , f'n-i, • • • , f'n+fc) ^ 'Sq 

such that for all a > cc and for all Oq, . . . , G Fq: 

(Fq, G) 1 = <p[«0) • • • ) O-n-l] 

Fq-I-^ I <p [do :■■■■> 1 , Fq , Fq_|_i , . . . , Fq^/j] . 
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Proposition 2.5. Every is closed with respect to first order defin- 
ability. 

Theorem 2.6. Uaeord = L 

So we have defined a hierarchy for the constructible universe. Proofs 
of the above theorem and propositions can be found in [Koecx)]. 

We show that the FQ-hierarchy is a refinement of the Ja-hierarchy: 

Theorem 2.7. For every a E Ord, F^q = Jq. 

The proof of this result occupies the rest of this paper and consists of 
a sequence of lemmas. Using Proposition 2.5 one can easily prove one 
inclusion. 

Lemma 2.8. For every a E Ord, Jq C F^^- 

Proof. By induction on a. For a = 0 and a = 1 this is true by definition, 
so assume 3^ C F^..^ for every 7 < a. Jq, is the rudimentary closure 
of the J-y’s. As rudimentary functions are first-order definable, F^^ is 
rudimentarily closed by Proposition 2.5. Hence F^q, includes Jq,. q.e.d. 

For the other inclusion, we show that the F-hierarchy is absolute 
for each where a > 1, i.e., the above recursive definition of the F- 
hierarchy can be carried out within Jq, and will define the same sequence 
as the recursive definition in V for all ordinals in Jq. Since Jq is rudi- 
mentary closed and contains the language S as an element, we see by in- 
spection that the recursive conditions in the definition of the F-hierarchy 
are absolute for Jq. Properties like 

= {x G F^ I F^ h 

or 

F7+1 = {^(F7,<^,^ I if E 5o,p G F^"^} 

refer to a quantifier-free satisfaction relation. Evaluating this within Jq 
involves evaluating finite sequences of values of terms and truth-values 
according to the structure of the formula 99 considered. These finite se- 
quences exist in the structure Jq just like they exist in the universe V. 
The other recursive conditions in Definition 2 are absolute for similar 
reasons. So the recursion can indeed be carried out absolutely in Jq pro- 
vided we can prove the following 
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Claim 5 . For all ordinals a and all /r < a; a, we have {F^\v < jj) e J„. 

This can be shown by induction on a. The limit case is trivial: if a 
is a limit ordinal the inductive hypothesis implies that for all /9 < a, all 
initial segments of the F-hierarchy of length < u(3 are in Jfs. This yields 
the property for Jq. 

Now consider the successor case: assume that a = /3 + 1 and that 
V/i < u/3 (F,^ I V < /i) G J/3- Then (F,^ | u < u/3) is definable over 
by the recursion formula and is thus an element of Jq. To show that the 
sequences (F,^ | v < /i) for u/3 < /i < ua are elements of Jq, it suffices 
to show by induction on u, uf3 < v < ua that F,^ G J«. 

Since the claim holds for /9, (F,^ | z/ < ujfi) is recursively definable 
over J^. Then the structure F^^ is definable over as the union of that 
tower of structures. Thus F^/j G J/?. 

For the successor step of this inner induction, we use a surjection 
/i G J /3+1 from oj(3 onto F^^+„ to define an F^/j+„-like structure over 
cu/9, hence over J^. We can then define the next level of the F-hierarchy 
within J/3+1. We start with some technical lemmas. 

Lemma 2 . 9 . Let x G ip+i and let h G J/3+1 be a surjection from cu/9 
onto X . Then x^"^ G J/3+1. 

Proof. The set = {s | Func(s) A dom(s) G cu A ran(s) C 

10(3} is a definable subset of J/3. Hence (cu/d)^'^ G J/3+1. Since J/3+1 is 
rudimentarily closed: 

x^‘^ = {ho s \ s e 
= h o s 

^ J/3+1- 



q.e.d. 

For a surjection h from uf3 onto x we denote by h the surjection from 
(cu/3)<‘^ onto x^"^ induced by h: for any s = , ^n-i) set 



h{s) :=hos = (/i(Co), • • • , h{in-i))- 



Then h G J/3+1 implies that h G J/3+1. 
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Lemma 2.10. For every a > 1, 

Proof. In [Dev84, VI.1.14], a language similar to S is defined over Ji. 
Therefore, Sq is definable over Ji and an element of for a > 1. q.e.d. 

The following lemma then concludes the proof of our theorem: 



Lemma 2.11. Let z/ = uf3 + n, = (F,^, J, S', e) G J/j+i, hy a 
surjection fromcu/S onto F,^ and G J/j+i- Then F,^+i G J^^+i and there 
is a surjection h^+i from cu/S onto F,^+i such that h^+i G J/j+i. 



Proof. Define a structure F,^ = -<j^, Sy, Ey) over oj(3 which is 

analogous to F,^: 



Fy {7 G uf 3 I hy{;^) G F,^} 
iy{(p, s) := {7 G 07/5 I hy{'y) G Iy{(p, /Ji.(s))} 
:= I {hy{l),hy{5)) G <y} 

:= I hy{,l) G hy{5)} 



These are all subsets of 3 p and elements of J/ 3 + 1 , hence definable over 
3p. We complete the structure by defining: 



<p€So 

se(a;/J)<“ 

sG(ujp)<‘^ 

fl/ • i,fuj '^U1 G;/)- 



The latter definitions are compositions of rudimentary functions and ele- 
ments of J/ 3 + 1 , hence fy G J/j+i. 
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We can now extend to a structure = {fu+i, V+i, ~<u+i) 
in a similar way as in the F-hierarchy: 

V+i(<^, s) := {7 e 0^/3 I h <^[7, 

Wi := U 

ipS>So 

seM)<“ 

fv+i-= IJ {wi(<^,s)} 

-<!/+l '- = -<v U{(7, ( 5 ) I 7 G /;y A 5 G /^+i\/!/} 

u {(7, <^) I s Vi/>, f(7 = s) 

A (5 = A (99 G i/’ V (99 = !/> A s <iex i))) 

Siy(s) if s G dom(s,y), 

the -<,^-least of v+i(v^, i) if s ^ dom(s,^) and 

s is of the shape 

0 otherwise. 

Sy+l ■ {(' 5 ) 

We look closer at the definitions to show that the structure f^+i is an ele- 
ment of J/3+1. First we remark that similarly to ordinary So-satisfaction, 
the f-satisfaction relation for quantifier-free formulae is rudimentary. For 
ordinary So-satisfaction this carried out in detail in [Dev 84 ]. Therefore 

iu+i G J/3+1 and /,^+i G J/3+1. 

To see that J/3+1 we note first that the first part of the defini- 

tion is an element of J/3+1 by assumption and the second as well as we 
know G J/3+1. For the third part we note that 

^ ^lex t \ y 

s ^ t A (dom(s) < dom(t) V (dom(.^ = dom(t)A 
((Vn G dom(^) s{n) > t{n) ( 3 m < n) s{m) -<p t{m)))) 

Finally the Skolem function is readily definable from -<^ and 
i„+i, hence is an element of J/3+1. So f^+i := {fu+i, iu+i, -<,.+1, G 

J/ 3 + 1 - 
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We now define 

:= IJ K{-f) 

TS/iz+l 

:= {^^^( 7 ) I 7 e iu+i) 

Iu+i-.= IJ 

<(3e5o 

se(o;/3)<“ 

<^+i := I (7,5) e^^+i} 

5’jy-i-i .= {(/i;y(^, /ji/(7)) I ('5)7) ^ 



By a now familiar argument we find Fj^+i G J/ 3+1 

We still need to eonstruet a surjeetion from cu/9 onto Fj^+i. We 
ean define a surjeetion g from cu/9 onto x (see for example 

[Dod82]). Let g' be its restrietion to 5 q x Then we define h^+i : = 

hy o o g'. It is easily seen this is a surjeetion and by eonstruetion we 
have g' G J/3+1, henee h^+i G J/3+1. q.e.d. 

In all, we have proved Theorem 2.7. q.e.d. 
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Abstract. Lutz in [LutOOa] has recently proved a new characterization of Haus- 
dorff dimension in terms of gales, which are betting strategies that generalize 
martingales. We present here this characterization and give three instances of how 
it can be used to define effective versions of Hausdorff dimension in the contexts 
of constructible, finite-state, and resource-bounded computation. 



1 Introduction 

Resource-bounded measure, a generalization of classical Lebesgue mea- 
sure, was developed by Lutz in 1991 [Lut92] in order to investigate the 
internal structures of complexity classes. This line of research has proven 
to be very fruitful [AmboMayi97], [Lut97], [LutMayOl], but there are 
certain inherent limitations to the information that resource-bounded 
measure can provide. These limitations, also present in classical Lebes- 
gue measure, come from the fact that measure cannot make quantitative 
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distinctions inside a measure zero set, and also from the Kolmogorov 
zero-one law ([DaiOl], [Lut98]) that implies that for most sets of interest 
in computational complexity and recursion theory the measure is zero, 
one or undefined. 

Hausdorff dimension was defined as an augmentation of Lebesgue 
measure theory [Hau 19]. Every subset of a given metric space is assigned 
a dimension. We are interested in the metric space being the Cantor space 
C consisting of all infinite binary strings. In this case the dimension of 
each set X C C is a real number dimH(X) G [0, 1]. Hausdorff dimension 
is monotone, dimH(0) = 0, dimH(C) = 1, and intermediate values oc- 
cur for many interesting sets. Also, if dimH(X) < 1 then X is a measure 
zero subset of C, so Hausdorff dimension can quantitatively distinguish 
among measure zero sets. 

Hausdorff dimension was originally defined topologically {cf. 
[Hau 19], [Fal85]). Lutz in [LutOOa] recently proved a new characteri- 
zation of Hausdorff dimension in terms of gales or supergales, which 
are betting strategies that generalize martingales. The class of gales that 
“succeed” on the sequences in a set X C C determines the dimension 
of X. Therefore a natural generalization of Hausdorff dimension arises 
by restricting the class of admissible gales or supergales. This is useful 
when we want a nontrivial dimension inside a countable set such as the 
set of all decidable sequences. 

In this paper we present three effectivizations of Hausdorff dimen- 
sion. The first one is constructive dimension {cf. [LutOOb], [Lut02]), de- 
fined by restricting to constructive (lower semicomputable) supergales, 
the second is finite-state dimension ([Dai-i-01]), defined by the natural 
finite-state effectivization, and the third one is resource-bounded dimen- 
sion, in which a particular complexity time-bound is enforced on gales. 

Hausdorff dimension has also been related to measures of informa- 
tion content. Ryabko {cf. [Rya86], [Rya93], [Rya94]), Staiger {cf 
[Stao93], [Stao98]) and Cai and Hartmanis {cf. [CaiHar394]) have all 
proven results that relate Hausdorff dimension with Kolmogorov com- 
plexity. This line of thought is also present in effective dimension. Con- 
structive dimension can be fully characterized in terms of Kolmogorov 
complexity {cf. [May02]) and finite state dimension is definable in terms 
of sequence compressibility {cf. [Dai-i-01]). We present here these two re- 
sults supporting the intuition that dimension is a measure of information 
density. 
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In Section 2 we introduce a few preliminary definitions. Section 3 
contains Lutz’s characterization of Hausdorff dimension, with the key 
concepts of gale and supergale. Section 4 presents constructive dimen- 
sion of sets and sequences. Section 5 contains finite-state dimension, and 
Section 6 is a brief sketch of resource-bounded dimension. 

2 Preliminaries 

We work in the set {0, 1}* of all (finite, binary) strings and in the Cantor 
space C of all (infinite, binary) sequences. We write |te| for the length 
of a string w G {0, 1}*. The empty string is denoted A. For S' G C 
and f, j G N, f > j, we write S[i..j] for the string consisting of the fth 
through jth bits of S, stipulating that S[0..0] is the leftmost bit of S. For 
each w G {0, 1}*, te is a prefix of S if te = S [0..\w\ — 1] or te = A. 

We will freely identify each language A C {0, 1}* with its character- 
istic sequence xa ^ C. 

We will refer to the low levels of the arithmetical hierarchy of sets of 
sequences, specifically to sets in II^, S 2 , and II 2 . The definition can be 
found in [Rogo67]. 

We will also mention sequences in S?, that is, r.e. sequences, se- 
quences in 115 , that is, co-r.e. sequences, and sequences in A°, that is, 
sequences that are decidable relative to the halting oracle. 

A function / : {0, 1}* — M is lower semicomputable if its lower 
graph Graph“(/) = |(a;,s) G {0,1}* x [0, cx)) s < /(a;)| is recur- 
sively enumerable. 

A real number a is computable if there is a computable function / ; 
N — Q such that for all r G N, |/(r) — a\ < 2“’’. A real number a 
is A 2 -computable if there is a function / : N — Q that is computable 
relative to the halting oracle such that for all r G N, |/(r) — a\ < 2“’’. 

Let f : N — > N. A function / : (0, 1}* — M is computable in time 
(space) t if there is a function / ; {0,1}* x N — that can be computed 
by a Turing Machine in time (space) t and such that for all x G {0,1}% 
r G N, |/(a;,r) - /(a;)| < 2 L 

The Kolmogorov complexity K(a;) of a finite binary sequence x is the 
length of the shortest description of x (the full definition and properties 
can be found in the book by Li and Vitanyi [LiVit97]). 
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We use the LOGSPACE-uniform version of the bounded-depth eir- 
euit eomplexity elass ACq ([Joh90]). 

For eaeh / : N — N, SIZE(/) is the set of languages A C {0, 1}* 
sueh that for all n G N there is an n-input boolean eireuit of at most f(n) 
gates that reeognizes A fi {0, 1}*^. 

We define the frequency of a nonempty string w G {0,l}*tobe 
the ratio freq(te) = , where ^{b, w) denotes the number of oeeur- 

renees of the bit b in w. For eaeh a G [0, 1], we define the set 



FREQ(a) 



I^G C 



lim freq(S' [0..n — 1]) = a 



The binary Shannon entropy funetion Ti ; [0, 1] — [0, 1] is defined as 



n{x) 



X log — h (1 

X 



x) log 

1 — a; 



with ?-f(0) = 7-f(l) = 0 to ensure eontinuity at 0 and 1. 



3 Lutz’ Characterization of Hausdorff Dimension 

This seetion reviews the gale eharaeterization of elassieal Hausdorff di- 
mension, whieh motivates our development. 

Definition 3.1. Let s G [0, cx)) 

1. An s-supergale is a funetion d : {0, 1}* — [0, cx)) that satisfies the 
eondition 

d{w) > [d{w0) + d{wl)] (*) 

for all w G {0, 1}*. 

2. An s-gale is an s-supergale that satisfies eondition (*) with equality 
for all w G {0, 1}*. 

3. A supermartingale is a 1 -supergale. 

4. A martingale is a 1-gale. 

Intuitively, an s-supergale is a strategy for betting on the sueeessive 
bits of a sequenee S' G C. For eaeh prefix w of S', d{w) is the eapital 
(amount of money) that d has after betting on the bits w of S'. When 
betting on the next bit & of a prefix wb of S' (assuming that b is equally 
likely to be 0 or 1), eondition (*) tells us that the expeeted value of d{wb) 




EFFECTIVE HAUSDORFF DIMENSION 



175 



-the capital that d expects to have after this bet- is (<i(teO) + d{wl))/2 < 
2^~^d{w). If s = 1, this expected value is at most d{w) -the capital 
that d has before the bet- so the payoffs are at most “fair.” If s < 1, 
this expected value is less than d{w), so the payoffs are “less than fair.” 
Similarly, of s > 1, the payoffs can be “more than fair.” 

Note: We will use in Section 4 the concept of /9-martingale, where (3 is a 
real number in [0, 1]. In this case when betting on the next bit 6 of a prefix 
wb or S we assume that (3 is the probability of h being 1 . Specifically, a 13- 
martingale is a function d : {0, 1}* — [0, cx)) that satisfies the condition 

d{w) = {1 — (3) d{w0) 3- (3 d{wl) 

for all te G {0, 1}*. Note that for /9 = 1/2 we obtain the above definition. 
Of course the objective of an s-supergale is to win a lot of money. 

Definition 3.2. Let d be an s-supergale, where s G [0, cx)). 

1. We say that d succeeds on a sequence S' G C if 

limsup(i(S' [0..n — 1]) = cx. 

n— >oo 

2. The success set of d is 

S°°[d] = js* G C d succeeds on S'! . 

3. For X C C,Q{X) is the set of all s G [0, x) such that there is an 
s-gale d for which X C S'°°[<i]. 

4. For X C C,Q{X) is the set of all s G [0, x) such that there is an 
s-supergale d for which X C S'“[ci]. 

Note that if s, s' G [0, x) then for every s-supergale d, the func- 
tion d' : {0,1}* — [0, x) defined by d'{w) = is an s'- 

supergale. 

It was shown in [LutOOa] that the following definition is equivalent to 
the classical definition of Hausdorff dimension in C. 

Definition 3.3. The Hausdorff dimension of a set X C C is 



dimH(X) = inf G{X) = inf G{X). 
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See [Fal85] for a good overview of classical Hausdorff dimension, in- 
cluding the original topological definition based on open covers by balls 
of diminishing radii {cf. [Haul9]). 

The gale characterization of Hausdorff dimension that we have pre- 
sented can be generalized by restricting the class of gales or supergales 
that are allowed. We will follow this idea in the rest of the paper. 

Eggleston in [Egg49] proved the following classical result on the 
Hausdorff dimension of a set of sequences with a fixed asymptotic fre- 
quency. 

Theorem 3.4. Eor each real number a G [0,1], 

dimH(FREQ(a)) = U{a). 

We will reformulate this last result in the contexts of the dimensions 
defined in Sections 4, 5, and 6. 

4 Constructive Dimension 

In this section we present the first effective version of of Hausdorff di- 
mension that is defined by restricting the class of supergales to those that 
are lower semicomputable. We first give the definitions of constructive 
dimension of a set and constructive dimension of a sequence, and then 
we relate them and give their main properties. We finish the section sum- 
marising the main results on constructive dimension. The results in this 
section are mainly from [EutOOb] and [Eut02], also including those in 
[May02] and [Hit02]. 

An s-supergale is constructive if it is lower semicomputable. We de- 
fine constructive dimension as follows. 

Definition 4.1. Eet X C C. 

1- ^constr(-^) is thc sct of all s G [0, cx)) such that there is a constructive 
s-supergale d for which X C 

2. The constructive dimension of a set X C C is 

cdim(X) = inf ^constr(X). 

3. The constructive dimension of an individual sequence R G C is 
dim(5') = cdimdR}). 




EFFECTIVE HAUSDORFF DIMENSION 



177 



By Lutz characterization of Hausdorff dimension (Definition 3.3), we 
conclude that cdim(X) > dimH(X) for all X C C. But in fact much 
more is true for certain classes, as Hitchcock shows in [Hit02]. For sets 
that are low in the arithmetical hierarchy, constructive dimension and 
Hausdorff dimension coincide. 

Theorem 4.2 (Hitchcock). If X C C is a union of 11° sets, then 

dimH(X) = cdim(X). 

We note that Theorem 4.7 below yields a new proof of Theorem 4.2 
from Theorem 5 of Staiger in [Stao98]. 

Hitchcock also proves that this is an optimal result for the arithmetical 
hierarchy, since it cannot be extended to sets in II^. 

For Hausdorff dimension, all singletons have dimension 0, and in fact 
all countable sets have Hausdorff dimension 0. The situation changes dra- 
matically when we restrict to constructive supergales, since a singleton 
can have positive constructive dimension, and in fact can have any con- 
structive dimension. 

Theorem 4.3. For every a G [0, 1], there is an S' G C such that dim(S') 
= a. [Lut02] 

We note that Theorem 4.7 below yields a new proof of Theorem 4.3 
from Lemma 3.4 of Cai Hartmanis [CaiHar394]. 

The constructive dimension of any set X C C is completely deter- 
mined by the dimension of the individual sequences in the set. 

Theorem 4.4 (Lutz). For all X C C, cdim(X) = sup^.g^j- dim(a;).^ 

There is no analogue of this last theorem for Hausdorff dimension or 
for any of the concepts we will define in Sections 5 and 6. The key ingre- 
dient in the proof of Theorem 4.4 is the existence of optimal constructive 
supergales, that is, constructive supergales that multiplicatively dominate 
any other constructive supergale. This is analogous to the existence of 
universal tests of randomness in the theory of random sequences. 

Theorem 4.2 together with Theorem 4.4 implies that the classical 
Hausdorff dimension of every E° set X C C has the pointwise char- 
acterization dimH(X) = sup^.gj,^' dim(a;). 

‘ Cf. [Eut02], 
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Theorem 4.4 immediately implies that eonstruetive dimension has the 
countable stability property, whieh also holds for elassieal Hausdorff di- 
mension. The proof ean be found in [Lut02]. 

Corollary 4.5. For all Xq, Xi, X 2 , . . . C C, 



Xfc = supcdim(Xfc). 

I fceN 

Let a G [0, 1]. If we define DIM<a = js* G C dim(5') < a|, then 
this is the largest set of eonstruetive dimension a. 

Theorem 4.6. For every a G [0,1], the set DIM<„ has the following two 
properties. 

1. cdim(DIM<Q,) = a. 

2. For all X C C, if cdim(X) < a, then X C DIM<q. 

The proof ean be found in [Lut02]. We note that Theorem 4.7 below 
yields a new proof of Theorem 4.6 (part 1) from Theorem 2 of Ryabko 
[Rya84]. 

The eonstruetive dimension of a sequenee ean be eharaeterized in 
terms of the Kolmogorov eomplexities of its prefixes. 

Theorem 4.7. For all A G C, 




dim (A) = liminf 

n— >00 



K{A [0..n- 1]) 
n 



This latest theorem is proven in [May02] and justifies the intuition 
that the eonstruetive dimension of a sequence is a measure of its algo- 
rithmic information density. The relation of dimension and information 
content will appear again in a different context in the next section, where 
the finite-state dimension of a sequence (to be defined) is characterized 
in terms of its compressibility. 

We now briefly state the main results proven so far on constructive 
dimension, including the existence of sequences of any dimen- 
sion, the constructive version of Eggleston theorem, and the constructive 
dimension of sequences that are random relative to a non-uniform distri- 
bution. 

An important result in the theory of random sequences is the exis- 
tence of random sequences in A°. We have the following analogue for 
constructive dimension. 
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Theorem 4.8. For every A^-eomputable real number a G [0, 1] there is 
a A° sequenee S sueh that dim(5') = a. [Lut02] 

And this eannot be improved to or Il'j’ sequenees sinee they all 
have eonstruetive dimension 0. 

This is the eonstruetive version of the elassieal Theorem 3.4 of Eggle- 
ston in [Egg49] and ean be found in [Eut02] . 

Theorem 4.9. If a is A^-eomputable real number in [0, 1] then 

cdim(FREQ(o)) = U{(x). 

An anonymous referee has pointed out that an alternative proof of 
Theorem 4.9 ean be derived from the newer Theorem 4.7 and earlier 
results of Eggleston (c/ [Egg49]) and Kolmogorov (cf. [ZvoEevVO]). In 
faet, this approaeh shows that Theorem 4.9 holds for arbitrary a G [0,1]- 

A sequenee is (Martin-Lof) random [Marsbb] if it passes every algo- 
rithmieally implementable test of randomness. This ean be reformulated 
in terms of martingales as follows 

Definition 4.10. A sequenee A G C is (Martin-Lof) random if there is 
no eonstruetive supermartingale d sueh that A G [SeheVl] 

By this definition, random sequenees have eonstruetive dimension 1. 
Eor nonuniform distributions we have the eoneept of /^-randomness, for 
(3 any real number in (0, 1) representing the bias. 

Definition 4.11. Eet (3 G (0,1). A sequenee A G C is (Martin-Lof) 
random relative to (3 if there is no eonstruetive /5-martingale d sueh that 
A G S^[d]. [SehgVl] 

Eutz relates randomness relative to a non-uniform distribution with 
Shannon information theory. 

Theorem 4.12. Eet f3 G (0, 1) be a eomputable real number. Eet A G C 
be random relative to f3. Then dim(A) = H{(3). 

A more general result for randomness relative to sequenees of eoin- 
tosses is obtained in [Eut02]. 

Very reeently, Eutz in [Eut02] has introdueed the eoneept of dimen- 
sion of finite binary strings, with very interesting eonneetions with eon- 
struetive dimension. We eannot eover this topie here due to laek of spaee. 
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5 Finite-State Dimension 

Our second effectivization of Hausdorff dimension will be the most re- 
strictive of those presented here, we will go all the way to the level of 
finite-state computation. In this section we use gales computed by multi- 
account finite-state gamblers to develop the finite-state dimensions of 
sets of binary sequences and individual binary sequences. The theorem 
of Eggleston is shown to hold for finite- state dimension. Every rational 
number in [0, 1] is the finite-state dimension of a sequence in the low- 
level complexity class ACq. The main theorem of this section shows that 
the finite-state dimension of a sequence is precisely the infimum of all 
compression ratios achievable on the sequence by information-lossless 
finite- state compressors. 

All the results in this section are from [Dai-i-01]. 

We start by introducing the concept of finite-state gambler that is 
used to develop finite-state dimension. Intuitively, a finite-state gambler 
is a finite-state device that places k separate binary bets on each of the 
successive bits of its input sequence. These bets correspond to k separate 
accounts into which the gambler’s capital is divided. Bets are required to 
be rational numbers in B = Q n [0, 1]. 

Definition 5.1. If k is a positive integer, then a k-account finite-state 
gambler (k-account FSG) is a 5-tuple 

G = (Q, S, j3, go, Co), 



where 

- Q is a nonempty finite set of states, 

- <5:(5x{0,l}— s>Qis the transition function, 

- /9 : Q — ^ is the betting function, 

- go G Q is the initial state, and 

- Co = (co,i, . . . , co,fc), the initial capital vector, is a sequence of non- 
negative rational numbers. 

A finite state gambler (FSG) is a /c-account ESG for some positive integer 
k. 



The case k = 1, where there is only one account, is the model of 
finite-state gambling that has been considered (in essentially equivalent 
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form) by Schnorr and Stimm in [Sch6Sti72], Feder in [Fed91], and oth- 
ers. 

Intuitively, if a /c-aeeount FSG G = {Q, 6, (3, go, co) is in state q ^ Q 
and its eurrent eapital veetor is c = (ci, . . . , Cfc) G (Q n [0, oo))’^, then 
for eaeh of its aeeounts i G {1, . . . , /c}, it plaees the bet /3i{q) G B. If 
the payoffs are fair, then after this bet G will be in state <5(g, b) and its ith 
aeeount will have eapital 



2ci[(l - 6)(1 - A(g)) + bf3i{q)] 



2pi{q)d ifb = l, 

2{1 - /3i{q))ci ifb = 0. 



This suggests the following definition. 

Definition 5.2. Let G = {Q, 6, (3, go, co) be a /c-aeeount finite-state gam- 
bler. 

1 . For eaeh I < i < k, the ith martingale of G is the funetion 



dG,*:{0,ir ^[0,oo) 



defined by the reeursion 



dc,iW = co,i, 

dcA^b) = 2dG,i{w)[{l - b){l - (3i{5{w))) + b(3i{5{w))] 
for all w G {0, 1}* and b G {0, 1}. 

2. The total martingale (or simply the martingale) of G is the funetion 

k 

do = dc,i- 

i=l 

3. For s G [0, cx)), the s-gale of an FSG G is the funetion 

4);{0,ir ^[0,oo) 

defined by 

for all te G {0, 1}*. In partieular, note that = do. 

4. For s G [0, CX)), a Unite-state s-gale is an s-gale d for whieh there 
exists an FSG G sueh that d^Q = d. 
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We now use finite-state gales to define finite-state dimension. 

Definition 5.3. Let X C C. 

1. ^Fs(-^) is the set of all s G [0, cx)) sueh that there is a finite-state 
s-gale d for whieh X C 

2. The finite-state dimension of set X is 

dimFs(X) = inf^Fs(-’^)- 

3. The finite-state dimension of a sequenee 5 G C is 

dimFs(5') = dimFs({5'}). 

In general, dimFs(X) is a real number satisfying 0 < dimH(X) < 
dimFs(X) < 1. Like Hausdorff dimension, finite-state dimension has the 
stability property. 

Theorem 5.4. For all X, L C C, 

dimFs(X U L) = max {dimFs(X), dimFs(L)} . 

Let us briefly digress on the role of multiple aeeounts in the FSG 
model. The proof of Theorem 5.4 is based on the faet that if d± and d2 are 
finite-state s-gales then di -f ^2 is a finite-state s-gale, that is, finite-state 
gales are elosed under nonnegative, rational, linear eombinations. But 
multiple aeeounts are required for this elosure property to hold, sinee 
there exist 1-aeeount finite-state s-gales di and d 2 sueh that di -f d 2 is not 
a 1-aeeount finite-state s-gale. 

Notwithstanding the usefulness of the above elosure property, the 
question remains whether multiple aeeounts are strietly neeessary for a 
theory of finite-state dimension. The next result shows that multiple ae- 
eounts are not strietly neeessary if we are willing to aeeept a large blowup 
in the number of states. 

Theorem 5.5. For eaeh n-state, fc-aeeount FSG G and eaeh e G (0, 1) 
there is an n ■ /c® -state, 1-aeeount FSG G' sueh that for all s G [0, 1], 



The theorem of Eggleston in [Egg49, Theorem 3.4] holds for finite- 
state dimension. 
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Theorem 5.6. For all a G Q fi [0, 1], 

dimFs(FREQ(Q;)) = 7d(o;). 

The following theorem says that every rational number r G [0, 1] is 
the finite-state dimension of a reasonably simple sequence. 

Theorem 5.7. For every r G Q fi [0, 1] there exists S G ACq such that 
dimFs(5') = r. 

The main result in this section is that we can characterize the finite- 
state dimensions of individual sequences in terms of finite-state com- 
pressibility. We first recall the definition of an information-lossless finite- 
state compressor. (This idea is due to Huffman [Huf59]. Further exposi- 
tion may be found in [Koh78] or [Kur74].) 

Definition 5.8. A finite-state compressor (FSC) is a 4-tuple 

C = {Q,S,u,qo), 

where 

- Q is a nonempty, finite set of states, 

- <5:(5x{0,l}— s>Qis the transition function, 

- z/ : Q X {0, 1} — > {0, 1}* is the output function, and 

- qo G Q is the initial state. 

For q E Q and w G {0, 1}*, we define the output from state q on input w 
to be the string u{q, w) defined by the recursion 

iy(q, A) = A 

u{q,wh) = u{q, w)h'{6{q, w) ,b) 

for all w G {0, 1}* and b G {0, 1}. We then define the output of C on 
input w G {0, 1}* to be the string 

C(w) = iy(qo,w). 

Definition 5.9. An FSC C = (Q, 5, u, go) is information-lossless (IL) if 
the function 

{o,ir ^{o,ir xQ 

w H- > (C(w), S{w)) 

is one-to-one. An information-lossless Unite-state compressor (ILFSC) is 
an FSC that is IL. 
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That is, an ILFSC is an FSC whose input can be reconstructed from 
the output and final state reached on that input. 

Intuitively, an FSC C compresses a string w if \C{w) \ is significantly 
less than Of course, if C is IL, then not all strings can be compressed. 

Our interest here is in the degree (if any) to which the prefixes of a given 
sequence S' G C can be compressed by an ILFSC. 



Definition 5.10. 1. If C is an FSC and S' G C, then the compression 
ratio of C on S' is 



gc{S) = liminf 



|C(S|0..n-l])| 



n 



2. The finite-state compression ratio of a sequence S' G C is 



^Fs(^) = inf {^c(^)|C' is an ILFSC} . 



The following theorem says that finite-state dimension and finite- 
state compressibility are one and the same for individual sequences. 

Theorem 5.11. For all S' G C, 



dimFs(S') = ^>Fs(*S'). 

Theorem 5.11 is a new instance of the relation of dimension and in- 
formation. 

Finite-state dimension is a real-time effectivization of a powerful tool 
of fractal geometry. As such it should prove to be a useful tool for im- 
proving our understanding of real-time information processing. 



6 Dimension in Complexity Classes 

We now use resource-bounded gales to develop dimension in complexity 
classes. 

Let p be the class of polynomial time computable functions from 
(0, 1}* to M. Let pspace be the class of polynomial space computable 
functions from {0, 1}* to M. Using these two classes we define p-dimen- 
sion and pspace-dimension in the natural way. 

Definition 6.1. Let A be p or pspace. Let ACC. 
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1. Q/^{X) is the set of all s G [0, cx)) such that there is an s-gale d G A 

for which X C [d] . 

2. The A-dimension of a set A C C is diniA(A) = inf Qa{X). 

Let E = DTIME(2^“®’^'') be the class of languages that can be rec- 
ognized in linear exponential time. Let ESPACE = DSPACE(2^“®'^’') 
be that class of languages that can be recognized in linear exponential 
space. 

The following result is proven in [LutOOa] and implies that p and 
pspace are the right dimension bounds for the classes E and ESPACE, 
respectively. 

Theorem 6.2. 1. dimp(E) = 1. 

2. dimpspace (ESPACE) = 1. 

Therefore we can define dimension in the classes E and ESPACE 
as follows. 

Definition 6.3. For each A C C, 

1. dim(A|E) =dimp(AnE). 

2. dim(A I ESPACE) = dimpspace(A n ESPACE). 

These definitions endow the classes E and ESPACE with internal 
dimension structure. Several interesting results have already been proven 
([LutOOa], [HitOl], [Ambi-i-01]), indicating that dimension can give a 
new insight to open questions in Computational Complexity, in particular 
since many sets of interest in Computational Complexity have “fractal- 
like” structures. 

We mention two of these results. 

Theorem 6.4. For all a G Q, 

dim(FREQ(a) | E) = dimp(FREQ(a)) = H{a). 

Theorem 6.4 is proven in [LutOOa] for a a “p-computable” real num- 
ber. Notice that we have proven Eggleston’s theorem in each of our for- 
mulations (for appropriate restrictions of a). This indicates that, no mat- 
ter how we limit our power to compute dimension, a bounded asymptotic 
frequency of the elements implies a restriction on the dimension of the 
set. 

We finish with a result on nonuniform classes that generalizes the 
known fact ([Lut92]) that SIZE(^) has measure zero in ESPACE: 
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Theorem 6.5. For every real a G [0,1], 



dim SIZE 




ESPACEJ 




FotFS 

III 



Benedikt Lowe, Boris Piwinger, Thoralf Rasch (eds.) 

Classical and New Paradigms of Computation 
and their Complexity Hierarchies 

Papers of the conference ‘‘Foundations of the Formal Sciences III” 
held in Vienna, September 21-24, 2001 



Axiomatizability of algebras of binary 
relations 

Szabolcs Mikulas* 

School of Computer Science and Information Systems 
Birkbeck College, University of London 
Malet Street, London WCIE 7HX, UK 

E-mail: szabolcs@dcs.bbk.ac.uk 



Abstract. We present an overview of the axiomatizability problem of algebras of 
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1 Introduction 

In algebraic logic, the most extensively investigated class of algebras 
of binary relations is the class RRA of representable relation algebras. 
These are Boolean algebras equipped with some extra-Boolean opera- 
tions arising from the nature of relations, like relation composition, rela- 
tion converse and the identity relation. The class RRA is a variety, i.e., it 
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is an equational class (cf. [Tar55]). When Tarski defined RRA, he raised 
the problem of whether RRA was finitely axiomatizable using equations; 
Monk answered the question negatively in [Mono64]. 

An interesting question is whether algebras of binary relations with 
smaller clones of operations can be finitely axiomatized. See [Schi91] for 
a survey of axiomatizability results of subreducts of RRA, i.e., classes of 
algebras where only a subset of the operations of RRA are present. In the 
last decade, researchers have continued to investigate subreducts of RRA, 
and significant results have been obtained. In this paper, we will mainly 
concentrate on subreducts of RRA in which the similarity type includes 
relation composition and at least one of the lattice operations of meet and 
join. We will recall the most important finite axiomatizability results and 
describe the so-called step-by-step method, that has been successfully 
used to achieve finite axiomatizations. Furthermore, we will show how 
to use ultraproduct constructions to establish non-finite axiomatizability 
results. In both cases, we will use games to make the constructions more 
intuitive. At the end of the paper we will mention some open problems. 

2 Basics 

First we recall the definition of representable relation algebras. 
Definition 2.1. A representable relation algebra is an algebra 
2t=(A,0,l, ^,1') 

such that A C p{W) (the powerset of W) for some equivalence relation 
W , 0 = 0,1 = W , ■ is intersection, + is union, — is complement with 
respect to W , ; is relation composition, "" is relation converse, and V is 
the identity relation. More formally, for all elements x,y E A, 

X ;y = {{u,v) eW : (u,w) E x and {w, v) E y for some w} 
x'~' = {(w,n) EW \ {v,u) E x} 

V = eW \ u = v}. 

The set W is the unit of 21, and the base of the algebra is the field of W , 
i.e., the smallest set U such that W C U x U. 

We denote the class of isomorphs of representable relation algebras 
by RRA. We also use ‘RRA’ as an abbreviation for the phrase ‘represen- 
table relation algebra’. 
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We will also consider classes of algebras with different similarity types 
than that of RRA. In general, we will say that an algebra is representable 
if it is isomorphic to a set algebra of binary relations. 

As we mentioned earlier, RRA is a non-finitely axiomatizable variety. 
It is an interesting problem to find out precisely which combinations of 
operations cause non-finite axiomatizability. This is one of the motiva- 
tions for investigating algebras with smaller similarity types. 

Definition 2.2. Let 21 G RRA and r be a set of operations whose el- 
ements are definable by fixed terms. The r-reduct of 21 is the algebra 

(A, o)o&T- 

The T-subreduct of RRA, R(r), is the class of subalgebras^ of the 
r-reducts of elements of RRA. 

When forming subreducts we will look at the following operations be- 
sides the primitive operations of the similarity type of RRA. The residu- 
als \ and / of ; are defined as follows: 

x\y = - - y) xjy= -{-x\ y'^). 

Their semantics is given by 

x\y = {{u,v) \ for all w, {w, u) e x implies (re, v) G y} 

x/y = G IL : for all w, (n, w) e y implies (m, w) G x}. 

The conjugates of ; are > and <. They are defined in RRA as 

x>y={x'~']y) X <y = {x-,y'^). 

Their meaning is 

X \> y = {{u,v) & W : for some w, {w, u) e x and (re, v) G y} 

X <y = {{u,v) eW : for some w, (n, w) e y and (m, w) G x}. 

While RRA is a variety, subreducts of RRA are not always closed 
under homomorphic images. However, it is not difficult to show that 
they are closed under forming subalgebras, products and ultraproducts. 
In other words, any subreduct of RRA is a quasivariety. Hence subreducts 

* As usual in algebraic logic, we assume that subalgebras include isomorphic copies as well. 
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of RRA can be defined by sets of quasiequations, i.e., universal formulas 
of the form 

(<Jl Qi A ■ ■ ■ A (Jjj (Jn) ^ ^0 i?0- 

The question is for which choice of r does there exist a finite set of 
quasiequations axiomatizing R(r). 

Remark 2.3. There is a subtle but important point worth mentioning 
here. Recall that, in the semantics of the extra-Boolean operations, the 
unit W of the algebra occurs as a parameter. Since we are talking about 
axiomatizing subreducts of RRA, an abstract algebra belongs to such a 
class if and only if it can be represented as an algebra of binary relations 
where the operations are computed in an equivalence relation. 

In certain cases, it is a natural question whether an algebra can be 
represented as a set algebra of binary relations where the operations are 
computed in a transitive relation, for example. The two problems are not 
equivalent in general; for different choices of W we might have different 
semantics of the extra-Boolean operations. As an example, see the case 
of semilattice-ordered residuated semigroups in the next section. 

3 Finite axiomatizability 

In this section we look at subreducts of RRA that can be axiomatized 
by finite sets of quasiequations. We will use the step-by-step method to 
establish finite axiomatizability. The origin of the method goes back to 
the early 20th century. The same idea has been used to prove model- 
theoretical results (e.g., Ldwenheim-Skolem theorems) and complete- 
ness theorems (e.g., Henkin’s completeness proof for first-order logic). 
In algebraic logic, it has been used to establish axiomatizability results, 
see for instance [Mad82], [AndiTho 288 ]; see also [HirHodi97a] which 
contains more historical remarks about the method used in other branches 
of logic as well. 



3.1 Semilattice-ordered semigroups 

First let us look at the subreduct R( • , ; ) whose similarity type consists 
of intersection ■ and composition ; . Clearly ■ satisfies the semilattice 
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axioms, viz. 

X ■ X = X 
X ■ y = y ■ X 
(x-y)-z = x-{y-z). 

As usual in semilattices, we define 

X <y X = X - y. 

It is easy to check that composition is an associative operation, i.e., 

{x-,y)]z = x]{y,z). 

Finally, let us observe that composition is a monotone operation with 
respect to the ordering <: 

{x < y Au < v) X ]u < y ]v. 

Thus { ■ , ; }-reducts of RRAs are (lower) semilattice-ordered semigroups. 
Bredikhin and Schein [BrcoSchiVS] showed that the converse holds as 
well. 

Theorem 3.1. Let E be the axioms for semilattice-ordered semigroups. 
Then we have 

21 e R( • , ; ) ^ a h S. 

Proof. To prove^ the theorem, we have to show that every semilattice- 
ordered semigroup can be represented over an equivalence relation, i.e., 
it is isomorphic to a set algebra of binary relations where composition is 
computed in an equivalence relation. We will build such a representation 
in a step-by-step manner. 

The main idea is to define a chain of partial representations such that 
the union of the chain defines the required isomorphism. The elements of 
the chain will satisfy certain coherency conditions that we will maintain 
during the construction. As we go along the chain we will approximate 
the structure of the abstract algebra to be represented by achieving a sat- 
uration property. 

We will define a chain of labelled graphs 



Go c ■ ■ ■ c c . . . 



^ The present proof is based on [AndiMik94]. 
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such that 

Gqi ( Uq^ , Eq^ , £q, ) , 

where Ua denotes the set of nodes, Ua x Ua is the set of edges, 

and ia ■ Ea ^ A is the labelling of the edges by elements of 21. We will 
maintain the following two coherency conditions: 

Ea is a transitive relation (1) 

for {x, y), {y, z) e E^„ z) < 4(a;, y) ; 4(i/, z). (2) 

We say that Ga is coherent if it satisfies the two coherency conditions (1) 
and (2) above. 

The graphs form a chain in the following sense: for a < (3, 

Ua C Uy, Ea C Ef), ia is the restriction of to Ea- 
Let G = {U, E,i) he the union of the chain: 

U = \JUa E = \jEa i = \Jia. 

a. a. a 

Clearly, G is coherent if all Ga are coherent. 

The saturation property we want to achieve is: for every a,b G A and 
(x,y) e E, 

y) < a;b implies i{x, z) = a, i{z, y) = b for some z e U. (3) 

Given a coherent and saturated graph G = (U,E,i), we define h : 
A — p{U X U) as follows: for every a e A, 

h{a) = {{x,y) & U X U : i{x,y) < a}. 

Let ® be the { ■ , ; }-reduct of the full RRA with unit U x U . We show 
that h is an embedding of 21 into !B. 

By the semilattice axioms and the definition of h, h is a homomor- 
phism with respect to ■ . Using the coherency condition (2), one easily 
shows that 

h{a ) ; h{b) C h{a ; b), 



while 



h{a ; b) C h{a ) ; h{b) 
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follows from the saturation property ( 3 ). Injectivity is guaranteed once 
we make sure that {{x,y) : i{x,y) < a} 7^ {(x,y) ■ i(x,y) < b} for 
distinct elements a and b of 21; see the initial step of the construction 
below. Then h is indeed the desired embedding. 

Thus it remains to construct the chain of graphs Ga- There are three 
cases in the construction according to whether a is 0, a successor, or limit 
ordinal. 

Initial step. In the base step of the construction, for every element a of 
the algebra 21, we take arbitrary distinct points Ua and Va and label the 
edge (mq, Va) by a. We choose these points such that they are distinct for 
different elements of the algebra, i.e., for a^b, {ua, Va} n {ub, Vb} = 0. 
We let Uo = {ua,Va : a e A}, Eo = {{ua,Va) : a e A} and 4 = 
{((Ma,Fa),a) : a e A}. The coherency conditions are trivially satisfied 
by Go = {Uo,Eo,io). Moreover, {{x,y) : io{x,y) = a} ^ {{x,y) : 
io{x,y) = b} for distinct elements a and 6 of 21 (thus h indeed is an 
injection). Finally, let Dq be an enumeration of all “defects”, i.e., those 
triples {{x, y), a, b) such that {x, y) G Eq, a,b e A and io{x, y) < a ;b. 

Successor step. Let us assume that we have defined a coherently labelled 
graph Ga = (Ua, Ea, ia)- Also assume that Da is an enumeration of all 
triples ((x, y), a, b) such that {x, y) G Ea, a,b e A and ia(x, y) < a; b. 
We define 

Ga+l i.Ua.\-l, Ea.\~l, da+l) 

as follows. Let ((x, y),a, b) be the first actual defect, i.e., the first element 
of Da such that there is no w G Ua such that la (x, w) = a and la {w,y) = 
b. We repair this defect as follows. Choose a fresh point z ^ Ua- We 
define 



Ua+l = UaA {z} 

Ea+i = {{x,z), {z,y)]VJ 

{(r, z) : (r, x) G Ea] U {{z, s) : {y, s) G Ea] 
la+i = 4 u {{{x,z),a), {{z,y),b)]U 

{((r, z),ia{r, x)]a) : (r, x) G Ea]U 
{{{z,s),b-,ia{y,s)) : {y,s) G Ea]. 



Clearly, Ea+i is transitive. Furthermore, using the axioms, it is routine to 
show that Ga+i satisfies the coherency condition (2). 
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Finally, we define D^+i by appending the new possible defects to the 
list Da- That is, we add the triples ((x, y), a, h) such that {x, y) G E^+i, 
a,b E A and (x,y) < a;b to the end of the enumeration Da. 

Limit step. Take the union of the chain of labelled graphs defined so far. 
Obviously, it satisfies the coherency conditions. Since we do not intro- 
duce any defect in the limit step, we can define the list of defects as the 
union of the previous lists. 

The construction continues as long as there are defects that we have not 
dealt with yet. Note that by the careful enumeration of possible defects, 
every defect is repaired sooner or later. See [AndiMik94] for a precise 
proof that the construction indeed terminates. Thus we can conclude that 
the final graph G is free from defects, i.e., it satisfies the saturation con- 
dition. q.e.d. 

Remark 3.2. 1. Note that the relation E constructed above is irreflex- 
ive and asymmetric, thus it is not an equivalence relation. However, 
the embedding h shows that the composition operation is computed 
in the equivalence relation U x U .In fact, using the similarity type 
{ ; , ■ } we cannot distinguish between representations using equiva- 
lence or transitive relations. The picture will drastically change when 
we include the residuals into the similarity type, see below. 

2. The coherency conditions have a universal flavor; these properties 
talk about all edges, and they have to be maintained during the whole 
construction. On the other hand, one might argue that the saturation 
condition is rather existential — it requires the existence of “witness- 
ing” edges providing the decomposition of the edge with label a ; b. 

One way to make this idea more explicit is to introduce games. There 
are two players, an existential player 3 and a universal player V. They 
start a game using an abstract algebra 21 — 3’s task is to show that 21 can 
be represented, while V tries to create obstacles. They build a chain of 
coherently labelled graphs. In the nth round of the game, V points out a 
defect, i.e., an edge {x, y) labelled with a ; b such that there are no edges 
{x, z) and {z, y) labelled by a and b, respectively. Then 3 tries to respond 
with an extension of the labelled graph containing the two labelled edges 
missing from the previous graph. Of course, 3 has to make sure that the 
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new graph is coherent — the coherency conditions are necessary require- 
ments for any (partial) representation. The universal player V wins the 
game if there is n < a; such that in the nth round 3 cannot respond with 
a coherently labelled graph. Otherwise 3 wins the game. 

This idea has been made explicit in the context of relation algebras, 
see [HirHodi97b]. Roughly speaking, a relation algebra is representable 
if and only if the existential player has a winning strategy in the repre- 
sentability game sketched above. In general, the existence of a winning 
strategy for 3 can be described by a set of first-order sentences that pro- 
vide an axiomatization for the class of algebras to be represented. Usually 
this axiomatization is infinite, though. 

3.2 Semilattice-ordered residuated semigroups 

Next we look at the possibility of extending the similarity type without 
losing finite axiomatizability. 

We mentioned earlier the residuals \ and / of composition. Their 
abstract characterization is given by the following two formulas: 

x]y < z i — >• y < x\z 
x]y < z i — >• X < zjy. 

For instance, the first one says that x\z is the largest element a such 
that x; a < 2 :. It is easy to check that the above two formulas are valid 
in representable algebras. It turns out that, together with the semilattice- 
ordered semigroup axioms S from Theorem 3.1, they are enough to en- 
sure representability over transitive relations, (cf. [AndiMik94]). 

Theorem 3.3. Every semilattice-ordered residuated semigroup can be 
represented as a set algebra of binary relations over a transitive relation. 

Proof. The proof of Theorem 3.3 is similar to that of Theorem 3.1. 
Again, one can define a chain of labelled graphs whose union defines 
the required representation of the abstract algebra. The coherency condi- 
tions are the same. But we require an additional saturation property, viz. 
for all a e A and x E U, there exist y,z eU such that 

i{y,x) = a i{x,z)=a. (4) 

This property can be achieved by adding the appropriate edges in the 
successor step a -f 1 of the construction. Of course, one has to label 
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the Other edges emerging when we make sure that Ea+i is transitive; 
this can be done in a similar way as above. Furthermore, the new graph 
has to satisfy the coherency conditions — a consequence of the axioms. 
Finally, one has to show that /i is a homomorphism with respect to the 
residuals as well. By definition of h, (u,v) G h{a\b) iff i{u,v) < a\b. 
The latter holds if and only if a ; i{u, v) < bhy the new axiom. Then 
we have that, for every 2 :, i{z, u) < a implies u ) ; ^{u, v) < b. The 
coherency condition (2) implies that, for every 2 :, i{z,u) < a implies 
^{z,v) < b. Hence (u,v) G h{a)\h{b). A similar argument using the 
saturation property (4) shows that h{a)\h{b) C h{a\b). Hence h is the 
required embedding. q.e.d. 



3.3 Axiomatizing R( • , ; , \? /) 

So far we proved a representation theorem for semilattice-ordered resid- 
uated semigroups over transitive relations. That is, we proved that the 
class of (isomorphs of) algebras of binary relations with set-theoretic 
operations of intersection, composition, and left and right residuals com- 
puted in a transitive relation is finitely axiomatizable. Consider now the 
problem of the finite axiomatizability of R( ■ , Recall that, in 

subreducts of RRA, the operations are computed in an equivalence re- 
lation. In other words, for axiomatizing R( • , ; , \, /) we need a represen- 
tation theorem with equivalence relations. The difference between the 
two axiomatizability problems is indicated by the following observation. 
In R( ■ , ; , /), the identity relation is always contained in a\a, while 

it clearly fails in the representation we gave in the proof above {E is 
irrefiexive). Additional axioms ensure that we can achieve finite axiom- 
atizability in this case as well. 

Theorem 3.4. R( ■ , ; , \, /) is finitely axiomatizable. 

The axioms are those for semilattice-ordered residuated semigroups to- 
gether with the following four additional axioms: 

X < y ^ z < z] {x\y) x < y ^ z < {x\y ) ; ^ 

X < y ^ z < z ] iy/x) x < y ^ z < iy/x ) ; 2 :. 

The proof goes along similar lines as for Theorem 3.3. However, it is 
more involved, since, in the constructed labelled graphs, Ea has to be not 
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just transitive but symmetric and reflexive as well. For details we refer 
the interested reader to [AndiMik94]. 

Of course, there are several other possible similarity types. We will look 
at adding union + or converse "" in the next section. Discussion of the 
identity constant is postponed until the last section. 

4 Non-finite axiomatizability 

In this section we look at classes of algebras that cannot be axiomatized 
by finite sets of quasiequations. We will also sketch a method using ultra- 
product constructions to establish non-finite axiomatizability (NFA, for 
short). This method dates back to the work of Tarski’s school of logic in 
the 1950s. 

The idea of establishing NFA of R(r) using ultraproducts is the fol- 
lowing. We will define Boolean algebras with extra operators of the RRA 
similarity type, 2t„ (n < u), and show that their r-reducts are not repre- 
sentable while an ultraproduct of the reducts is representable. By Los’ 
theorem [Hodo93, Theorem 9.5.1], this is enough to show that R(r) is 
not finitely axiomatizable in first-order logic. 

Now assume that r C r' and that R(r) is not finitely axiomatizable. 
What can we say about R(r')? There is no general theorem indicating 
that R(r') is not finitely axiomatizable, but this is often the case. If we 
can show that in fact an ultraproduct of 2t„ for n < cu (and not just an 
ultraproduct of their r-reducts) is representable, then we can conclude 
that no R(r') with r C r' can be finitely axiomatized. If this is the case, 
we will say that R(r) is hereditarily non-finitely axiomatizable (HNFA, 
for short). 

The following results show that including union -f or converse "" 
into the similarity type leads to non-finite axiomatizability. 

- R( + , ; ) is NFA, (cf. [Andi88]); 

- R(;,^)isNFA,(c/:[Breo77]); 

- R( -,+,;) is HNFA, (cf. [Andi91]); 

- R( ■ , ; , ^) is HNFA, (cf [HodiMikOO], [Hai91]); 

- R( • , ; , 1', >) and R( ■ , ; , 1', <) are HNFA, (cf [HodiMikOO]). 

Thus, while the subreduct to intersection and composition is finitely ax- 
iomatizable, if intersection is replaced by either union or converse, the 
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resulting subreduct is not finitely axiomatizable. If to intersection and 
composition we adjoin union or converse (or the identity relation and 
one of the conjugates), we get a hereditarily non-finitely axiomatizable 
class. The problem of HNFA in the first two cases seems to be open. 

Below we will show how to establish the HNFA of R( ■ , 0) us- 

ing a variant of the “rainbow” construction from [HodiMikOO]. This par- 
ticular similarity type will make the presentation of the idea slightly sim- 
pler. 

Theorem 4.1. R( • , ; , 0) is HNFA. 

Proof. We will define algebras (n < a;) of the RRA similarity type. 
Then we will show that (i) their { ■ , ; , 0} -reducts are not representa- 

ble, while (ii) their ultraproduct 21 is in RRA. 

We will show (i) by using the games mentioned in Remark 3.2. We 
will provide the universal player V with tools (the green atoms gi (0 < 
i < 2”) below) to gradually build a linearly ordered sequence. The alge- 
bra 2t„ is designed so that the existential player 3 has to respond with a 
similar linearly ordered sequence (using the red atoms (1 < i < 2”)) in 
any representation of 2t„. But V’s sequence is longer than the sequence 3 
can build (there are more green atoms than red). Hence there is a round in 
the game played on 2t„ when 3 is not able to respond to V’s move. Thus 
V can win the game, whence 2t„ cannot be represented. However, as n 
grows 3 can survive more and more rounds in the representability game, 
since it takes more and more rounds to show that V’s sequence is longer 
than 3’s. Then in the limit, i.e., in the game played on the ultraproduct 21, 
3 can survive infinitely many rounds. It follows that 21 G RRA, establish- 
ing (ii). 

We are ready to define the algebras. The construction is rather in- 
volved, since we have to make sure that 3 can survive several rounds, 
i.e., the reason for non-representability does not become immediate. Let 
n be any natural number. 2t„ has the following atoms (minimal non-zero 
elements): 

- identity: 1', 

- greens: gi {0 < i < 2"-), 

- whites: w, Wjj (0 < i < j < 2"-), 

- yellow: y, 

- black: b. 
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- reds: o (0 < i < 2”). 

The elements of the algebra are the sums of the atoms, i.e., all possible 
subsets of the set of atoms. Then the Boolean operations are defined as 
set-theoretic union, intersection and complementation. All elements x are 
self-converse, i.e., x'~' = x. Next we will define composition for atoms. 
Then composition for other elements of the algebra can be defined using 
additivity: 



x;y = E {u;v : u < x,v < y, u,v atoms}. 

The composition of two atoms x^ and xi is defined by specifying pre- 
cisely which atoms are below xq;xi, or equivalently, by specifying 
which atoms are not below xq ;xi. We say that a triple {xo,xi,X2) of 
atoms is a forbidden triangle if X2 is not one of the atoms below xq;xi, 
in symbols (xq ; xf ■ X 2 = 0. Since the ultraproduct of the algebras 2t„ is 
to be in RRA, we want to satisfy the so-called cycle law. Since every 
element is self-converse, it says that, for every permutation vr of 3, we 
have {xTr(o) ; a;,r(i)) ■ Xn{ 2 ) = 0 as well. In other words, if (xq, xi, 0 : 2 ) is a 
forbidden triangle, so is every permutation. The forbidden triangles (up 
to permutations) are precisely the following: 

(green,green,green) 

(yellow, yellow,yellow) 

(green,green,white) 

(yellow, yellow,black) 

(rj,rj,rfc) unless i + j = k or i + k = j or j + k = i 
(gi,gi+idj) unless j = 1 
(gi,y,Wjfc) unless i E {j, k}, 

where, e.g., (green,green,white) stands for: {g;g') -w = 0 for all green 
atoms g, g' and any white atom w. We also require that (!', x, y) and its 
permutations are forbidden for all distinct atoms x and y, since we want 

V -,x = X. 

As an example let us look at the composition of two green atoms with 
consecutive indices. We have 



gi ; gi+i = n + b -f y, 
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since the eomposition of two green atoms with eonseeutive indiees ean- 
not be white, green, the identity, or any red but ri . Note that the eompo- 
sition of the yellow atom with itself eannot be blaek or yellow. Thus we 
have 

(&;g*+i) ■ (y;y) = ri. 

Finally, let us look at 



(g*;gi+2fc)-(y;y)-(rfc;rfc) 

for some indiees i and /c > 0 sueh that i + 2k < 2”. We have already 
seen that the interseetion of the eomposition of green atoms and of the 
eomposition of yellow with itself eannot be green, white, blaek or yellow, 
and identity is ruled out by the different indiees on the green atoms; thus 
it must be a union of red atoms. But the only red atom below is r 2 k- 
Thus the value of the above displayed term must be r 2 k- 

Next we show that the { ■ , ; , 0}-reduet of is not represen- 
table as a set algebra of binary relations over an equivalenee relation. 

Lemma 4.2. For any n < cu, is not in RRA. Moreover, the 
{ ■ , ; , 0}-reduet of 2t„, is not representable either. 

Proof. We will show that the universal player V ean win the repre- 
sentability game played on 2l„. First let us reeall the eohereney eondi- 
tions for this game. Let us assume that we have an edge (x, y) labelled 
by a and let / be a finite index set sueh that, for eaeh i E I, the edges 
{x, Zi) and {zi, y) are labelled by bi and Cj, respeetively. Then by the see- 
ond eohereney eondition (ef. (2) in the proof of Theorem 3.1, a < &* ; c* 
for eaeh i e I. Henee a < 'i must hold as well. Sinee, we are 

eonsidering a subreduct of RRA, the first eohereney eondition (ef. (1)) 
says that the labelled graph must be an equivalenee relation. Sinee eon- 
verse is in the similarity type, if an edge {x, y) is labelled by a, then the 
eonverse edge (y, x) must be labelled by a~' . Sinee a = a'~', the direetion 
of the edges will not matter; 

the label of {x, y) must be the same as the label of {y, x). (5) 

Furthermore, all reflexive edges must be labelled — this ean be done us- 
ing the identity atom 1'. We will not bother mentioning this in the future. 
Consequently, in the pieture illustrating the argument below. Figure 1, 
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we do not indicate the orientation of the edges, and the reflexive edges 
are not shown. 

The game starts with an edge {u, v) labelled by w 7^ 0 . In the initial 
round of the game, V points out that w < go ; y. Thus 3 has to create edges 
(m, Mo) and (mq, v) labelled by go and y, respectively. Since w < gj ; y for 
all i < 2”, in the ith round of the game V can force the existence of Ui 
{i < 2”) such that (u,Ui) is labelled by g* and (ui,v) is labelled by y. 
Note that by ( 5 ), («*,«) must be gj and (n, Ui) must be y. 

Now let us have a look at how 3 can respond to the above challenges 
of V. In the first round of the game, 3 has to label the edge {uq, ui). Since 
(go ; gi) ■ (y ; y) = the coherency requirement on the graph says that 3 
must label {uq, ui) by ri. The same argument shows that, in the {i + l)th 
round, 3 has to use ri to color the edge between Ui and Ui+i (for i < 2 "^). 
Indeed, (gj ; gj+i) ■ (y ; y) = ri for every i < 2 ”, whence the color of 
(wj, Mj+i) must be ii. 

So far so good, but 3 has to label all the edges (ui,Uj) for 0 < 
i,j < 2 "^. In particular, in the second round, 3 must label {uo,U2). By 
(go ; g2) ■ (y ; y) ■ (ri ; ri) = ra, 3 is forced to use ra to color (mq, U2). 
In general, we have that (g* ; gj+a) ■ (y ; y) ■ (ri ; ii) = ra determines the 
color of {ui, Ui+2) to be ra. By induction, we get that {uk, Mfc+20 must be 
labelled by ra*, since (gfc ; g^+aO ' (y ; y) ■ (i'2*-i ; ra*-i) = In particu- 
lar, the label of (mo;^2"-i) nnd (wa^-1,^2") is In the 2"th round, 
3 has to find a non- zero label for {uq, M2")- But this is impossible, since 
(go ; g2") ■ (y ; y) ■ (i2"-i ; ^a"-!) = 0. Thus 3 cannot respond with a co- 
herent graph in the 2*^th round. 

See Figure 1 for an illustration of the argument; the solid lines repre- 
sent V’s moves and the dotted lines show 3 ’s responses. q.e.d. 

One might wonder why we made the definition of 2 t„ so complicated. 
Why not simply use red and green atoms and require that, say, the com- 
position of green atoms must be red, gj ; gj+i = ri, and (r* ; r^) ■ = 0 

whenever the sum of two indices is different from the third one? It is not 
difficult to see that this would imply that go ; ga»*-i = g2»-i ; g2" = i'2"-i- 
Then V could win the game just in two rounds independently of the value 
of n. Thus the ultraproduct would not be representable either. 

Let i and j be different, non-consecutive indices. In our construction, 
gj ; gj is the sum of y, b and all the red atoms. In addition to the green 
atoms, V has to use yellow to force 3 to use reds: y ; y cannot be yellow 
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or black. Furthermore, 3 has some freedom in choosing the index of the 
red atom — the choice is constrained by the forbidden red triangles, i.e., 
by the indices of the red atoms used so far in the game. 

In the proof of Lemma 4.2, it was 3’s responsibility to color the (red) 
edges {ui, Uj). Would not it be faster for V to win the game by choosing 
some of these red colors? For instance, V might start the game with an 
edge {uq,U 2 ) and label it with rs. Then V would force 3 to extend the 
graph by pointing out that rs < go ; g 2 and that rs < y ; y. Then we would 
have edges (mo,m) e go, (m,M 2 ) e g 2 , {uo,v) G y and {v,U 2 ) G y. If V 
could force 3 to label new edges {uq,ui) and (mi,M 2 ) by ri like in the 
above proof (by requiring that (u,ui) be gi and (ui,v) be y), it would 
be the end for 3 (since (ri ; ri) ■ ro = 0). This is where the indexed white 
atoms Wij come into the picture. In the present scenario, 3 is to choose 
the color of ( m , v), and wo 2 is a possibility. Recall that (g^ ; y) ■ Wjj = 0 
whenever k ^ {*,?}• Hence if 3 labels (u,v) by wo 2 , then V cannot 
require the existence of Ui such that (m, ui) be labelled by gi and (mi, n) 
by y. 

Looking at 3’s strategy in general, it should be obvious that it is not 
a clever idea to use red atoms (unless it is unavoidable) and that using 
green and yellow atoms is not advisable either (since they can force later 
edges to be red). When the graph has to be extended, 3’s strategy is to 
use an indexed white whenever this is a possible choice. The black atom 
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gives 3 another option to avoid using red, green or yellow; when V does 
not use yellow and green for extending the graph and no indexed white 
is available, 3 uses blaek. Finally, 3 uses a red atom only if neither white 
nor blaek ean be used. 3 ehooses the index of the red atom depending on 
the number of rounds played so far and on the indiees of the red atoms 
oeeurring in the graph; cf. [HodiMikOO] for details. 

Thus the blaek and the indexed white atoms give 3 some leeway to 
postpone losing the game. In faet, the only way V ean win the game is 
to play a sequenee of green and yellow atoms as deseribed in the proof 
of Lemma 4.2. Furthermore, a win for V requires more and more rounds 
as n grows. When the two players play the game on the ultraproduet 21 
of the algebras 2t„ (n < cu), then 3 ean survive infinitely many rounds 
and thus has a winning strategy. Henee 21 is representable as an RRA. We 
ean eonelude that R( • , ; , 0) is HNFA. We refer the interested reader 
to [HodiMikOO] for a preeise proof. q.e.d. 

Remark 4.3. 1 . It is not very diffieult to show that the same eonstrue- 
tion ean be used to establish that the { ■ , >, <l}-reduet of 2t„ is not re- 
presentable using an equivalenee relation. Henee we have that 
R( •,>,<) is HNFA.3 

2. A related issue to finitely axiomatizing a elass K of algebras is the 
problem whether the equational theory of K is finitely axiomatizable 
in equational logie. We will not pursue this issue here; the interested 
reader is referred to [HodiMikOO] where it is shown that, for all fi- 
nite n, there is a valid equation that fails in the nth reduet algebra, 
establishing that the equational theory is not finitely axiomatizable 
either. 

5 Open problems 

So far the general, if somewhat simplified, pieture is that the interaetions 
between eomposition ; and union -f or eonverse "" lead to non-finite 
axiomatizability. But what ean we say about the interaetion between ; , 
■ and identity 1'? 



^ A joint result with I. Hodkinson. 
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Identity. The most obvious candidate as an axiom for the identity con- 
stant is 

x ; 1 = 1 ]x = X. 

But it is not very hard to show that it is not enough for axiomatizing 
R( ■ , ; , 1') over R( • , ; ).^ In fact, one of the most challenging open prob- 
lems in this field is the following.^ 

Question 3. Is R( ■ , ; , 1') finitely axiomatizable? 

A related question is whether R(;, - admits a finite axiomati- 

zation. In [AndiMik94] a representability result was established for the 
specific case in which the identity constant 1' satisfies the axiom 

X -.,y < l' ^ {x = \/ y = \/ X = y = l') 

as well. However, this axiom is usually not valid, hence the general finite 
axiomatizability question remains open. 

Conjugates. In Remark 4.3, we mentioned the HNFA of R( • , >, <l). An 
alternative would be if we allow representations over transitive (not nec- 
essarily equivalence) relations. Is this class of algebras finitely axiom- 
atizable? It can be shown that the class of algebras representable over 
transitive relations of similarity type { ■ , ;,>,<} is HNFA [Mik02]. It 
would be interesting to find the borderline precisely where we lose finite 
axiomatizability. 

Complexity. It is well known that the equational theory of RRA is unde- 
cidable [Tar41]. On the other hand, the equational theory of the positive 
fragment of RRA, R(l, 0, • , -f , ; , 1'), is decidable [AndiBreo94], but 

not much is known about the exact complexity of specific subreducts. As 
an example we mention that it follows from [Lam58] that the equational 
theory of R( • , ; , \, /) is decidable in exponential time, but no matching 
lower bound has been found so far. 



There are semilattice-ordered monoids which are not representable because they fail to vali- 
date the formula (x \ y) ■ V = tt ^ {y \ x) ■ 1' = Q, for instance. 

^ This problem was mentioned by B. M. Schein at the Algebraic Logic Colloquium in Budapest 
in 1988. 
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Finite base property. The step-by-step construction usually yields a rep- 
resentation on an infinite base. It would be interesting to find out which 
(classes of) algebras can be finitely represented. There are results about 
finite representability in certain cases, cf. [AndiHodiNem99], but it is 
not clear if existing techniques can be applied to algebras with transitive 
units. A related issue is when it is enough to consider algebras with finite 
bases to refute non- valid formulas. Let us mention a specific problem 
[vBe91,p.349]. 

Question 4. Is it the case that any non-valid equation of R( ■ , ; , \, /) 
can be refuted in an algebra represented on di finite equivalence relation? 
The same question applies when we consider algebras represented over 
transitive relations. 
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presence of regularity properties for projective sets of reals. 

The new result shown in this paper says that ZFC + the bounded proper forc- 
ing axiom (BPFA) + “every projective set of reals is Lebesgue measurable” is 
equiconsistent with ZFC + “there is a Ei reflecting cardinal above a remarkable 
cardinal.” 



1 Introduction. 

The current paper is in the tradition of the following result. 

Theorem 1.1. The theory ZFC + Martin’s axiom (MA) + “every pro- 
jective set of reals is Lebesgue measurable” is equiconsistent to the exis- 
tence of a weakly compact cardinal. [HaroShe85, Theorem B] 
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This theorem links a large cardinal concept with a forcing axiom 
(MA) and a regularity property for the projective sets of reals. We shall 
be interested in eonsidering foreing axioms whieh are somewhat stronger 
than MA. In partieular, we shall produee the following new result.^ 

Theorem 1.2. Equieonsistent are: 

1 . ZFC + “there is a Si refleeting eardinal above a remarkable eardinal”, 
and 

2. ZFC + the bounded proper foreing axiom (BPFA) + “every projeetive 
set of reals is Lebesgue measurable”. 

We don’t expeet the reader to be familiar with the eoneepts appearing 
in the previous theorems, as we are going to explain them here. We do 
have to presuppose, though, a basie understanding of forcing and con- 
structibility theory, this material is eovered by the text books [Jee78] and 
[Kun80]. 

Large eardinals are ubiquitous in set theory. In [HauI4], Hausdorff 
had asked whether there is a regular fixed point of the aleph funetion 
i I— > Kj. We know today that the existenee of sueh a fixed point eannot 
be shown in ZFC, beeause if n is regular and n = then |= ZFC. 
(Here and in what follows we assume that ZFC is eonsistent.) We may 
say that a formula p{—) defines a large eardinal eoneept if ZFC proves 
that yK{p{K) ^ Lk 1= ZFC). A large eardinal axiom is a statement of 
the form 3n p{k) for some large eardinal eoneept given by p{—). Any 
large eardinal axiom is thereby independent from ZFC. 

Foreing axioms are also independent from ZFC. The prototype of all 
foreing axioms, Martin’s axiom, was isolated from the Solovay-Tennen- 
baum proof of the eonsisteney of the non-existenee of Suslin trees on oji 
(cf [Jee78, Theorem 50] or [Kun80, II, §4]). A foreing axiom asserts the 
existenee of reasonably generie filters for posets from a fixed elass of 
foreings. One may often view foreing axioms as a subsitute for CFI (the 
eontinuum hypothesis) in eontexts where CH fails. 

Projeetive sets of reals are primarily studied in deseriptive set theory. 
A projeetive set eomes from a elosed set via a finite proeess of taking 
eomplements and projeetions (cf [Jee78, p. 500]). The theory ZFC typ- 
ieally deeides questions about “simple” projeetive sets, but it leaves the 

* This theorem was first presented in a talk at the 6ieme Atelier International de Theorie des 
Ensembles in Luminy, France, on Sept 20, 2000. 
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arbitrary case open. For instance, every analytic set of reals is Lebesgue 
measurable in ZFC, whereas the Lebesgue measurability of all A 2 sets 
of reals is independent from ZFC. Nevertheless, descriptive set theorists 
believe it is true that every projective set of reals is Lebesgue measurable. 

We are now going to study the Lebesgue measurability of all projec- 
tive sets of reals in the context of forcing axioms. 

Set theorists obtain insight into the mathematical structure provided 
by various hypotheses by determining their consistency strength. We’ll 
write Con(S) ^ Con (S') for the statement that the consistency of the 
theory S implies that of S'. We say that S and S' are equiconsistent just 
in case Con(S) Con (S'). The consistency strength (or, large cardinal 
strength) of a given hypothesis is that large cardinal axiom (p{n) 
such that ZFC + 'F is equiconsistent with ZFC + 3k (p{k). This paper 
can also be read as an introduction to a method for proving equiconsis- 
tency results. Typically, the direction Con(ZFC + ^) ^ Con(ZFC + 
3k (p{k)) uses constructibility theory, whereas Con(ZFC + 3k (p{k)) 
Con(ZFC + \F) is an application of the method of forcing. 

2 MA and the Harrington-Shelah theorem. 

Let P be a poset, and let 2) be a collection of sets such that every D E D 
is a subset of P. We say that a filter F C P is T)-generic just in case 
F f] D ^ 0 for every D E T). D CF is predense (in P) if every p G P 
is compatible with some q E D. D CF is dense (in P) if for every p G P 
there is some stronger q E D. If D is the collection of all dense subsets 
of P then a ©-generic filter cannot exist, unless P contains atoms. On 
the other hand, if D is countable then a ©-generic filter always exists. In 
particular, if M is a countable transitive model of ZFC with P G M and 
if © G M is the collection of all the dense subsets of P which exist in 
M then there is a ©-generic filter; it is one of the key building blocks of 
the theory of forcing that such a filter G can then be “adjoined” to M to 
produce another model of ZFC, M[G]. 

The approach of forcing axioms is to ask for more than countably 
many dense sets to be met at once, where P will have to belong to a 
specific class of posets. In what follows we shall focus on the case that 
no more than Ki many dense sets are to be met. 
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Definition 2.1. Let T C V be a class of posets. We say that MA(r) holds 
if for every P G F and for every collection D of sets such that 

• Card(D) < Ki, and 

• D is dense in P for every D G D 

there is some D -generic filter. 

Let c.c.c. denote the class of all posets which have the countable 
chain condition. We take Martin’s axiom, which is abbreviated by MA 
and which was mentioned in Theorem 1.1, to say that MA(c.c.c.) holds. 
In particular, MA -as we define it- implies that > K 2 (i.e., that the 
continuum hypothesis, CH, is false). 

The reader may consult [Jec78, §§22, 23, 44] or [KunSO, Chapters II 
and VII] on MA. One can construe MA as a generalization of the Baire 
category theorem (cf. [KunSO, Theorem II. 2. 22]). The reader might also 
enjoy the discussion of the plenty consequences of M A that can be found 
in [Fre84]. 

It can be shown that Con(ZFC) implies Con(ZFC + MA) (cf. the 
paragraph right after Theorem 2.3 below). If > K 2 then one can 
require k many dense sets to be met for any k < 2^°, but it is inconsistent 
to require 2^° many dense sets to be met. 

By [KunSO, Theorem II. 3. 4], MA is equivalent to the fact that for 
every P G c.c.c. which is of size < Ki and for every collection D of sets 
such that Card(S}) < Ki and D is dense in P for every G S) there 
is some D -generic filter. This readily implies that MA is equivalent to 
BMA(c.c.c.) according to the following definition. 

Definition 2.2. Let F C V be a class of posets. We say that BMA(F) 
holds if for every P G F and for every collection S) of sets such that 

• Card(S}) < Ki, and 

• D is predense in P and has size < Ki for every D E D 
there is some D -generic filter. 

In order to uncover the relation of MA with the Lebesgue measura- 
bility of projective sets of reals we need a first large cardinal concept. We 
say that a cardinal n is inaccessible ii k > Kq, k is regular, and 2"^ < k 
for all cardinals 7 < k . We say that Ui is inaccessible to the reals if 
is countable for every real x. Notice that if oji is inaccessible to the reals 
then oji is an inaccessible cardinal from the point of view of L. 
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Theorem 2.3. If every projective set of reals is Lebesgue measurable 
then oji is inaccessible to the reals. In fact, if every Eg set of reals is 
Lebesgue measurable then ui is inaccessible to the reals. [She84] 

There is a poset P G c.c.c. such that MA provably holds in 
(cf. [Jec78, §23] or [Kun80, VIII. §6]). Let us assume that V = L. We 
shall then have that uY = ojY = and we may apply Theorem 2.3 
inside to get that in there is a projective set of reals which is not 
Lebesgue measurable. In particular, M A does not imply that all projective 
sets of reals are Lebesgue measurable: 

Corollary 2.4. Equiconsistent are: 

1. ZFC, and 

2. ZFC + M A + “there is a projective set of reals which is not Lebesgue 
measurable”. 

We now want to approach Theorem 1.1. A cardinal k is called weakly 
compact if k, is inaccessible and there is no Aronszajn tree on k, i.e., for 
every tree T of height k such that each level of T is of size < k there 
is a branch through T of length k. Equivalently, k is weakly compact if 
K is nj indescribable in that for every A C k and for every 111 formula 
), if Vk +1 1= 97(A) then there is some \ < k with V^+i |= fl A) 
(cf. [Jec78, p. 386]). The proof of Con(2) ^ Con(l) needs an improve- 
ment of Theorem 2.3 in the presence of M A. 

Lemma 2.5. Assume that MA holds. If Ki is inaccessible to the reals 
then in fact Ki is weakly compact in L. [HaroShe85] 

Proof. Eet us suppose that MA holds and that Ki is not weakly compact 
in L. We shall produce a real x such that = uji. 

As Ki is not weakly compact in L, there is a tree T G L such that (in 
V!) T is an Aronszajn tree on uji (this is due to Silver, cf. [HaroShe85, 
Claim 5]). We may and shall assume that T has infinitely many nodes of 
height 0. Eor a sequence d = (di\ i < c<;i) of subsets of uj we define P(d) 
by setting p G F(d) iff p is a finite order preserving partial function from 
T into Q (the rationals) such that if a G dom(p) has height u ■ iinT and 
p(a) G N C Q then p(a) G dj. We set p <p(/) g iff p D q. It is not hard 

to verify that P(ci) G c.c.c. 
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Now let h = {hi \ i < c<;i) be such that bi uj and > i for each 
i < oji.li suffices to show that there is some c C cu with b G L[c]. For 
i < oj I let us write T for the restriction of T to nodes of height < i. 

We shall define 6” = (6”: i < oii) and F„ by induction own < u. 
Let 6° = bi for i < uji. Given 6”, let be, by an application of MA, a 
S)„-generic filter on P(6*^), where Dn is some appropriate collection of 
dense sets with Card(Dn) = such that the following will hold true: 

(1) „ m e 6” iff there is an a of height u ■ iinT with F„(a) = m, and 

(2) n Fn is continuous at limit ordinals. 

Let 6”+^ C a; be a canonical code for Fn\{T fa; ■ (i + 1)), where i < uji. 
Finally, let c C o; be a code for (6q : n < uj). 

It is now easy to verify that n < uj,i < uji) G L[c]: Let i < uJi, 
and suppose that we already know n < uj,j < i). As codes 
Fn\{T fa; ■ (j + 1)) for every j < i, we can thus compute f(T fa; ■ i). 
But {2)n then gives us f(T fo;-i + l), and hence by (1)„ we can compute 
6”. q.e.d. 

Let us now turn towards Con(l) ^ Con(2). The key concept here is 
L(M) -absoluteness for a class of posets. Let F C V be a class of posets. 
We say that L(M) -absoluteness for F holds (in V) iff for all P G F we 
have that the first order theories (with names for reals from V) of L(M) 
and agree with each other. 

Lemma 2.6. Let F C V be a definable class of posets. Let n be an 
inaccessible cardinal, and suppose that for all P G ^ 

reals x G we have that there is some poset Q G with 

X G V®. Then L(M) -absoluteness for F holds in 

Proof. Let G be Col(o;,< /t)-generic over V, and let H be P'^-generic 
over V[G]. Finally, let E be Col(o;, (2^°)^['^][^])-generic over V[G][Lf]. 
Let(ei: i < u) e \[G][H][E] be such that {e^ : i < ca} = Mn V[G][i/]. 
By working inside VfG][iF][i?] we may use the hypothesis of Lemma 
2.6 to construct (o;*, Gj : i < u) such that for all i < u we shall have 
that ai < cii+i < K, Gi is Col(o;,< o;j)-generic over V, Gj_i C Gi 
(with the convention that G_i = 0), Gi G V[G][iT], and G V[Gj]. 
Set G' = IJ. Gi. Because Col(o;,< k) has the k-c.c., G' is Col(o;,< k)- 
generic over V, and every real in V[G'] is in VfGj] for some i < u. We 
thus get that M n V[G'] = M n V[G] [H]. 
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This yields that the first order theories of and 

agree with each other: because Col(o;,< k) is homogeneous (cf. [Jec78, 
Theorem 63]), we get that for any formula (p{v) and for any T G M fl 
V[G] we have that |= (p{x) iff II“co 1 (L,<k) “L(M) |= iff 

L(MV[G][H]) ^ ’ q.e.d. 

Theorem 2.7. Let k be inaccessible. Then in we have that 

every projective set of reals is Lebesgue measurable. [SolVO] 

Corollary 2.8. Let T C V be a definable class of posets. Suppose that k 
is an inaccessible cardinal and is a statement such that 

(a) for all P G ^ ^ YCo1(oj,<k)*p 

there is some poset Q G V«; with x G V®, 

(b) (f holds in and 

(c) provably in ZFC + (p, there is some P G T such that MA(T) holds in 

VP_ 

There is then a poset § such that in we have that: MA(r) holds and 
all projective sets of reals are Lebesgue measurable. 

The same result holds with “ MA(T)” being replaced by “ BMA(T).” 

Proof. We set § = Col(cu,< k) ^ P for some P which is provided by (b) 
and an application of (c) inside The result follows by Lemma 

2.6 and Theorem 2.7, via (a). q.e.d. 

We are now in a position to be able to give a proof for Con(2) ^ 
Con(l) of Theorem 1.1. As it is provable in ZFC that there is a poset 
P G c.c.c. such that MA holds in V’'', we may apply Corollary 2.8 with p) 
being some logical truth, say. It remains to see that (a) of Corollary 2.8 
holds for r = c.c.c. and for k being weakly compact: 

Lemma 2.9 (Kunen). Let k be weakly compact. Suppose that 

YCo1(o;,<«) ^ up -g ^ pgggj jj^g g g_g_„_ 

Then for all reals x G we have that there is some poset 

Q G Vk with X G V®. [HaroShe85] 
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Proof. It is easy to verify that Col(o;,< k) a P g c.c.c. Let us work with 
Boolean algebras, and set 23 = r.o.(Col(cu,< k) a P). It suffices to show 
that any X C of size < k, generates a complete subalgebra of ® of 
size < K. 

Well, there is clearly a complete subalgebra ® of 23 with X C ® C 
23 and Card(?B) < k. W.l.o.g., ^ C k. Let A C be the set of all 
maximal antichains of As k is 11} indescribable, there is some \ < k 
such that X C A,?BnAisa< A-complete Boolean algebra, and A n [A] ^ 
is the set of all maximal antichains of fl A. But then ® fl A is a complete 
Boolean algebra, and hence a complete subalgebra of !B. q.e.d. 

3 PFA and projective sets. 

If X is a set then [X]"^ denotes the set of all subsets of X of size Kq. 
Let q; > Ki. A set S' C [aY is called stationary iff for all models = 
(a; . . .) of finite type and with universe a there is some (X; . . .) ^ 971 
such that X G S'. A poset P is called proper if for all a > Ki, whenever 
S' C [aY is stationary in [aY then 

V'*' 1= S is stationary. 

We let proper denote the class of all posets which are proper. The proper 
forcing axiom, abbreviated by PFA, says that MA(proper) holds. 

The reader may find information on PFA in [Jec86] and [Bau84]. In 
particular, if there is a supercompact cardinal then there is a proper poset 
P such that PFA holds in V''' (cf. [Jec86, Theorem 6.2]). 

In this section we shall see that PFA implies that all projective sets of 
reals are Lebesgue measurable. It is not known how to prove this directly, 
though. We have to show a transfer theorem of the following form: if 
PFA holds then there are certain inner models for such-and-such large 
cardinals, which in turn implies Lebesgue measurability for projective 
sets. The argument will in fact give that PFA implies much more, namely 
Projective Determinacy. 

Here is the key concept which links PFA with inner model theory. 
Let K be an infinite cardinal. We say that holds if there is a sequence 
(Q: K < i < A lim(f)) such that each Q is a closed unbounded 
subset of i of order type < k and Cj = C* fl j whenever j is a limit 
point of i. The principle was isolated by Jensen in the context of his 
seminal work on Godel’s constructible universe. 
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Lemma 3.1. PFA implies for all k > Ki. [Tod84] 

Proof. Let T be the set of all limit ordinals i with n < i < . Suppose 

{Ci \ i eT) witnesses that □«; holds. For j <ie T let us write j -< i iff 
j is a limit point of C* (iff j < i and Cj = C* fl j). Then (T, -<) is a tree, 
which we shall denote by T. 

Set P = {p C : p is closed and countable}, ordered by end- 
extension. Notice that P is cu-closed, and forcing with P adds a closed 
unbounded subset of of order type cui. The proof of the following 
Claim makes use of all the properties of our DK-sequence. 

Claim 6. In V'*', there is no branch through Tl" IJ G of order type cui. 

Proof of Claim. Suppose that p G P forces that there is some such 
branch. Notice that P x P is also cu-closed. Let Go x Gi be P x P generic 
over V with p G GofiGi.Letfe^ G V[Gft] be a branch through T|'1JG'^'‘ 
of order type cui, for h G {0, 1}. Let ^ ^ !}• 

^jv[GoxGi]('^+^ _ straightforward to see that we must have 

= A^. Therefore, A^ G V. But then we must have otp^(A°) = 
and therefore there is some i G T with otp(Gj) > k. Contradiction! 

q.e.d. (Claim) 

Inside there is hence a forcing Q for adding, with finite con- 
ditions, a specializing function / for TfljG, i.e., an order preserving 
/; TI'IJG — Q (cf. the forcing P(cT) in the proof of Lemma 2.5). A 
A-system argument shows that Q has the c.c.c. In particular, the iteration 
P X Q is proper. 

For k < uji, let Dk be the set of all (p, g) G P x Q such that otp(p) > 
k + 1 and if i is the kth. element of p then (p, q) decides f{i). Each Dk is 
dense. Let D = {Dk'. k < cui}. 

By PFA, let {Hq, Hi) be a 2D-generic filter. Let i = sup(lj Hq), and 
let / be the specializing function for TflJ Hq which is provided by Hi. 
Then / witnesses that T|' IJ LTo cannot have a branch of order type ui. 
However, G* fl IJ LTo is exactly one such branch. Contradiction! q.e.d. 

The following two results were products of Jensen’s fine structural 
analysis of L. 

Lemma 3.2 (Jensen). V=L implies for all k > Ki. [Jen72] 
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We say that 0^ exists if there is an elementary embedding tt : L ^ L 
with 7T ^ id. The existenee of 0^ is a large eardinal axiom in that if 
7T : L ^ L has eritieal point k then |= ZFC. 

Lemma 3.3 (Jensen). Suppose that 0^ does not exist. If k is a singular 
eardinal then .In faet, if X C L then there is a set F G L with 

Y D X and Card(F) < Card(X) + Ki. [DevJenVS] 

Corollary 3.4. PFA implies that 0"^ exists. 

Proof. Assume that PFA holds. Set k, = By Lemma 3.1 we know 
that Dk fails. On the other hand, L |= V = L, and therefore we have that 
L 1= Dk by Lemma 3.2. Let (C* : k < i < lim(i)) G L be a witness. 
By Lemma 3.3, and therefore (C*: k < i < K+,lim(i)) in 

faet witnesses that holds in V. Contradietion! q.e.d. 

We may look at the previous proof from a more abstraet point of view. 
It tells us that PFA implies that there ean be no pair (k, W) sueh that k is 
a eardinal, W is an inner model, , and W |= □«;. However, the 

existenee of sueh pairs (k, W) ean be shown from weaker anti large ear- 
dinal hypotheses than the non-existenee of 0^. Sueh an analysis leads to 
the following remarkable theorem, via work of Martin, Mitehell, Sehim- 
merling. Steel, and Woodin. In its statement, PD denotes Projeetive De- 
terminaey (cf. [Jee78] p. 560). We refer the reader to [Kano94, Chapter 
6] for a thorough and nieely written diseussion of the signifieanee of PD 
in modem set theory. 

Theorem 3.5 (Martin, Mitchell, Schimmerling, Steel, Todorcevic, 
Woodin). PFA implies PD. 

Proof. The basie idea, whieh is due to Woodin, is to prove by induetion 
on n < cu that V is elosed under the operator x M*( x) for all n < oj. 
Here, M* ( x) denotes a suffieiently iterable sharp for an inner model with 
n Woodin eardinals whieh eontains x. The reader may find the details of 
sueh an induetion in the paper [ForiMagSeh201]. The point is that if 
n < OJ is sueh that V is elosed under x i— > M*( x) but M^_^;^(a;o) does 
not exist for some Xq then there is a eonstruetible inner model L[i?] sueh 
that and \j[E] |= for all large enough singular eardinals 

K. This gives a eontradietion with PFA via Lemma 3.1. q.e.d. 
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A classical result of Mycielski and Swierczkowski yields that PD im- 
plies that every projeetive set of reals is Lebesgue measurable (cf. [Jee78, 
Theorem 102 (a)]). We therefore finally established the following. 

Corollary 3.6. PFA implies that every projeetive set of reals is Lebesgue 
measurable. 

We do not know whether there is a proof of Corollary 3.6 whieh does 
not go through PD. 

We let stap denote the elass of all posets P whieh are stationary pre- 
serving, i.e., sueh that for all S' C cci we have that 

if S is stationary, then |= “5 is stationary”. 

Every proper foreing is elearly stationary preserving, but the eonverse is 
false. 

The papers [ForiMagShe88a] and [ForiMagShe88a] introduced Mar- 
tin’s maximum, abbreviated by MM, as MA(stap). Obviously, MM im- 
plies PFA. We refer the reader to [Jee86, III §7] or [ForiMagShe88a] and 
[ForiMagShe88b] for further information on MM. 

4 A few remarks on OCA. 

Fet X be a set. By [X]^ we denote the set of all {x, y} C X with x ^ 
y. Fet [X]^ = KqU Ki with Xq fl Xi = 0; Kq, Ki is then ealled a 
eolouring of [X]^. For h G {0, 1} we say that X C X is h-homogeneous 
iff [X]2 C Kh. 

Now let X C M. We view X as a topologieal spaee with the topology 
being indueed by that of M. The set [X]^ inherits a topology from X (the 
produet topology). We say that OCA(X) holds iff for every eolouring Kq, 
Ki of [X]^ sueh that Kq is open in [X]^, either there is an uneountable 0- 
homogeneous X C X or else X is a eountable union of 1 -homogeneous 
sets. The Open Colouring Axiom, abbreviated by OCA, is the statement 
that OCA(X) holds for every X C M. The reader may find baekground 
information about OCA in [Tod89, Chapter 8] and [FaroTod93, Chapter 
10 ]. 

Lemma 4.1. PFA implies OCA. [Tod89, Theorem 8.0] 
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Proof. LetX C M, and let Kq, Ki be a eolouring of Let us suppose 
that X is not the union of eountably many 1-homogeneous sets. Then in 
YCo 1 (o;,r)^ X is still not the union of eountably many 1-homogeneous 
sets. By [Tod89, Theorem 4.4], in there is therefore some un- 

eountable Y C X sueh that 



F={pYY : Card(p) < Kq, [p]^ C Kq}, 

ordered by reverse inelusion, has the e.e.e. Now Col(cu, M) a P G proper, 
and an applieation of PFA yields an uneountable 0-homogeneous subset 

of X. q.e.d. 

It ean be shown, however, that Con(ZFC) implies Con(ZFC + OCA). 
In partieular, OCA eannot imply that every projeetive set of reals is Lebes- 
gue measurable, by Theorem 2.3. 

We say that OCA*(X) holds iff for every eolouring Kq, Ki of [X]^ 
sueh that Xq is open in [X]^, either there is a perfeet 0-homogeneous Y C 
X or else X is a eountable union of 1-homogeneous sets. For all X C 
M, OCA*(X) implies that X has the perfeet subset property; therefore, 
OCA*(X) eannot hold for all X CM (under the axiom of ehoiee, that 
is). 

Theorem 4.2. Projeetive Determinaey PD implies OCA*(X) for every 
projeetive X C M. [Feno83] 

The following eorollary is thus another example of a transfer theo- 
rem. It follows from Theorems 3.5 and 4.2. Again (cf. the remark right 
after the statement of Corollary 3.6) we do not know whether there is a 
proof of Corollary 4.3 whieh does not go through PD. 

Corollary 4.3. PFA implies OCA*(X) for every projeetive X C M. 

Question 5. Suppose that OCA*(X) holds for every projeetive X C M. 
Is then every projeetive set of reals Lebesgue measurable? 

5 BP FA and projective sets. 



We shall from now on be eoneerned with bounded foreing axioms, i.e., 
with BMA(F) for elasses of posets F. It turns out that BMA(F) is mueh 
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weaker than MA(r) for any reasonable elass T of posets whieh is signif- 
ieantly larger than c.c.c. 

The next lemma says that BM A(T) ean be phrased in terms of generie 
absoluteness. The proof ean be found in [BagOO]. 

Lemma 5.1. Let P be a poset. Then BMA(P) holds iff for every Si for- 
mula , Vn) and for all Ti, . . . , G 

^ \= (f{xi,. . . ,Xn). 

The bounded proper foreing axiom, abbreviated by BP FA, was intro- 
dueed by Goldstern and Shelah in [GoliShe95] as BMA(proper). This 
seetion will produee a proof of Theorem 1.2. 

We say that a regular eardinal n is Si refieeting iff for all a G and 
for every formula ), if there is some regular ^ > k with |= <h(a) 
then there is some regular § < k with |= $(a) {cf. [GoliShe95, 
Definition 2.2]). It ean easily be verified that every Si refieeting eardinal 
is inaeeessible and that if A is a Mahlo eardinal then the set of all k < A 
whieh are Si refieeting in Va is stationary in A. 

Theorem 5.2. If BPFA holds then M 2 is a Si refieeting eardinal in L. 
On the other hand, if k is a Si refieeting eardinal then there is a poset 
P G proper sueh that in V’'', BPFA holds. [GoliShe95] 

We say that a eardinal k is remarkable iff for all regular eardinals 
^ > K there are tt, M, R, a, N, and d sueh that the following hold: 
7T : M — is an elementary embedding, M is eountable and transitive, 
7t(k) = k, a \ M — is an elementary embedding with eritieal point R, 
N is eountable and transitive, d = M D OR is a regular eardinal in N, 
a{R) > d, and M = H^, i.e., M G N and N |= “M is the set of all sets 
whieh are hereditarily smaller than i)” (cf. [Seh200a, Definition 0.4]). 

We are now ready to prove Theorem 1.2. Let us eommenee with prov- 
ing Con(2) ^ Con(l). By Theorems 2.3 and 5.2 it will be enough to ver- 
ify that if BPFA holds and Mi is inaeeessible to the reals then in faet Mi 
is remarkable in L. For this in turn, via Lemma 5.1, it suffiees to prove 
the following. 

Lemma 5.3. Suppose that Mi is not remarkable in L. There is then a 
poset T G proper sueh that in V^, there is a real x with uji = 
[Seh201] 
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Proof. There is an cu-elosed poset P such that in V'*', = L «2 [^] 

for some A oji. This part of the argument just uses the non-existence 
of 0^. We may for instance let P = Col(<5^, 2^) a Co1(o;i, 5) a P where 
(5 is a singular cardinal of uncountable cofinality with 5^° = 5 and P is 
a forcing which codes some D C <5+ of the intermediate model, where 
L 5 +[D] = H 5 +, hy A <Z uji using <5+ many pairwise almost-disjoint 
subsets of 5 provided by Jensen’s Covering Lemma for L. 

Now fix such P and A G V’''. In V^, we may define a poset Q by 
setting p G Q iff there is an ordinal a < liJi such thatp: a {0, 1}, and 
for all /9 < a we have that 

L[t 4 n /9,p|'/9] 1= /9 is countable. 

It can be verified that the fact that Ki is not remarkable in L implies that 
V’*' 1= Q G proper. Therefore, P a Q G proper. 

Forcing with P a Q adds some B C ui such that 

1j[B A P] \= P is countable 

for every P < oji. Inside V’'”*'®, we may therefore define a sequence 
(oj : i < cui) of pairwise almost-disjoint subsets of uj such that for every 
i < coi, tti = the L[i? n f] -least subset of u which is almost-disjoint 
from every Uj, j < i. We may then define a poset § G by setting 
P = {^{p)a{p)) e § iff i{p) : n — {0, 1} for some n < u and r(p) 
is a finite subset of Ui. We let p = (^(p),r(p)) <g g = ii{q),r{q)) iff 
i{p) D i{q), r{p) D r(g), and for all i G r(g), if f G i? then 

{m G dom(f(p)) \ dom(f(g)) : i{p){m) = 1} fl a* = 0. 

The generic object will give the characteristic function of some subset a 
of u such that f G iJ iff a fl a* is finite. We shall have that V’''*'® |= § G 
C.C.C., so that T :=PaQa§g proper. 

However, in a will be such that H^2 = L^2 [a]. q-e.d. 

Let us now turn towards proving Con(l) ^ Con(2) of Theorem 1.2. 
We plan on using Corollary 2.8 for T = proper. Fix k < \ such that 
K is remarkable and A is Si reflecting. It is easy to verify that A is still 
Si reflecting in Let <p denote the statement that there is a Si 

reflecting cardinal. We then have (b) and (c) of Corollary 2.8, the latter 
one by the second part of Theorem 5.2 (more precisely, by the proof of 
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[GoliShe95, Theorem 4.1]). It remains to verify (a) of Corollary 2.8; we 
shall state this as a lemma. 



Lemma 5.4. Let k be a remarkable eardinal. Let G be Col(o;,< n)- 
generie over V. Let P G V[G] be a proper poset, and let H be P-generie 
over V[G]. Then for every real x in \[G][H] there is a poset G 
sueh that x is Q^^-generie over V. [Seh201] 



Proof. Let i) > k be a large enough regular eardinal; in partieular, we 
want that ^(P) C H^[G], Le., that the power set of P be eontained in 
H^[G]. Let X G M n V[G][/7], and let x G H^[G] be sueh that x^ = 
X. The faet that k is remarkable in V implies that in V[G], there is a 
stationary set S of eountable subsets of [G] sueh that for eaeh X G S 
we have the following: there are tt, a, and d sueh that 



7T 



(H 



v[GnHy] 



;G,H|,GnH 



Va 






is an elementary embedding with X = ran(7r), a is the eritieal point of 
7T, and a and d are regular eardinals of V (and henee of V[G fl H^]) 
(this aetually eharaeterizes remarkability; cf. [Seh201, Lemma 1.6]). As 
P is proper, S is still stationary in V[G] [H], 

Now eonsider the strueture 



m=iU4G];e,F,x,H). 



Beeause S is stationary in V[G] [if], we may piek some X G S sueh that 
X is the universe of an elementary submodel of 9K. Let 



it: 






be the inverse of the Mostowski eollapse of X. By the elementarity of 
TT, H is P-generie over and henee over all of V[G fl H^j, 

as ^(P) n V[G n C Moreover, by the definability of 

— — — 

foreing, we get that n E x iff3pEHp\\-hExiff3pEHp\\-nEx 
iff n G iff n G T. So T = x, and we may set Qa; = Col(c<;,< a) * P 

where P = P. Notiee finally that Qa; G V^. 



q.e.d. 
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6 BMM. 

This final section will be concerned with another bounded forcing axiom. 
Bounded Martin’s maximum, abbreviated by BMM, was introduced in 
[GoliShe95] as the statement that BMA(stap) holds. There is a lengthy 
discussion of BMM in [Woo99, §10.3]. 

Lemma 6.1. Suppose that does not exist for some set X. Then there 
is a poset T G stap such that in V^, there is a real x with cui = 
[SchsOO] 

Proof. The point is that if X* does not exist for some set X then we 
can still define (a version of) the forcing T from the proof of Lemma 5.3; 
moreover, T will be stationary preserving. q.e.d. 

As a corollary to Lemma 6.1 do we get that BMM is much stronger 
than BPFA in the presence of Lebesgue measurability of projective sets. 

Corollary 6.2. Suppose that BMM holds. If cui is inaccessible to the re- 
als then V is closed under sharps, i.e., X* exists for every set X. In 
particular, if BMM holds and every projective set of reals is Lebesgue 
measurable then V is closed under sharps. 

This corollary can be improved to get the following better lower 
bound on BMM + “every projective set of reals is Lebesgue measurable.” 

Theorem 6.3. If BMM holds and every projective set of reals is 
Lebesgue measurable then there is an inner model with a strong cardi- 
nal. [Sch 200 ] 

As an upper bound, we only have: 

Theorem 6.4. Suppose that ZFC -F “there is a proper class of Woodin 
cardinals” is consistent. Then so is ZFC -F BMM + “every projective set 
of reals is Lebesgue measurable.” [Woo99] 

This leads to the obvious question: what is the consistency strength of 
ZFC -F BMM -F “every projective set of reals is Lebesgue measurable”? 
Let me finish with a conjecture which consists of two parts. 

Conjecture 6.5. 1. If BMM holds and every projective set of reals is 
Lebesgue measurable then there is an inner model with infinitely 
many strong cardinals. 

2. If BMM holds and every set of reals in L(M) is Lebesgue measurable 
then the axiom of determinacy holds in L(M). 
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sets of reals as oracles. This higher type theory has not as yet received 
the same attention as the theory on integers. The aim of this article is 
to provide some commentary on what we feel this higher type theory 
should look like, and to ask a number of questions whose answers should 
help develop that theory, and place it in a context with other theories of 
reducibilities at this level. 

In this section we give some of the basic definitions \ and mention an 
alternative machine architecture that yields the same class of computable 
functions on single tape machines. In the next section we state the cor- 
responding definitions and facts for the higher type theory. We give no 
proofs of these latter results. Often they are simply the analogous ones 
for the theory on integers, and the reader can construct them for his or 
herself. 

We refer the reader to [HamLewOO] or to Joel Hamkins’ article in this 
volume, for the basic structure, definitions, and results concerning such 
machines. We let Pn denote the nth program in some fixed enumeration. 
In Section 2 we shall assume some familiarity with the notions of deter- 
minacy of two person perfect information games — see, e.g., [Kano94]. 
For a pointclass T, “Det(T)” will denote the assertion that games with 
payoff sets A G T have winning strategies for one of the players. 

It was noted in [HamSeaOI] that the machine architecture could be 
replaced by a single tape machine using 0 and 1 as an alphabet, but only 
as long as one considered computable functions / ; N — > N. For func- 
tions F : M — M it turned out that a single tape is insufficient. 

We first note here that a not unattractive one (or three) tape machine 
model is obtained which computes the same class of functions (on either 
integers or reals) as the original machine architecture, by allowing into 
the alphabet blank cells on the tape^. The purpose of a “blank” or empty 
cell is to signify ambiguity. We adopt a new limit rule for specifying the 
contents of cells at limit stages: if a cell’s value has varied cofinally often 
below A, we set the value to the “ambiguous” value of a blank. 

Ambiguity limit rule. If A is a limit ordinal, then the contents of 

the ith cell on the tape at time A, Ci(A), is given by: 

* For more details, cf. [HamLewOO]. 

^ To be sure one now has a machine that in essence works on 3“ instead of 2“, although for 
the theorem above we still consider a: € R input as an infinite sequence from {0, 1} only. A 
function F : R ^ R is computable by such a machine, if for all real input strings, the output 
strings are also real strings, i.e., without blanks. 
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If there is uq < A sueh that for all u < A(z/q < u implies Cj(uo) = 

Cj(z/)) then set Cj(A) = Cj(uo); otherwise set Cj(A) to be a blank. 

One then has: 

Theorem 1.1. Let C be the elass of funetions F : M — M eomputable 
by the Hamkins-Lewis maehines of [HamLewOO], and C' those of the 
one-tape maehine just speeified. Then C = C' . 

It is not hard to see that if we now define a one-tape maehine using 
this ambiguity limit rule, then its aetion ean be simulated on a {0, 1}- 
valued three tape [HamLewOO] -maehine. It was noted in [HamSeaOl] 
that the single tape maehines with {0, 1} values only eomputed a elass 
of funetions that was not elosed under eomposition. This had the effeet 
that the natural simulation of a three-tape maehine by a one-tape ma- 
ehine eould not realise that it had performed a final eompression of the 
simulated output^. 

We have here an extra symbol -namely the blank- in the alphabet, 
and this gives us enough room to set flags and enable us to elose up un- 
der eomposition, and thus ealeulate the same funetions as the three-tape 
model. (For readers familiar with [HamSeaOl] this is really an exereise.) 
However, the theorem above notwithstanding, we shall stiek to the def- 
initions and eonventions of [HamLewOO] for this paper. The rest of the 
definitions of this subseetion are all taken from that paper. It should be 
noted that we deliberately elide the distinetion between subsets of a; and 
their eharaeteristie funetions as elements of 2“^ and write “n G g" inter- 
ehangeably for “(y'(n) = 1” ete. 

Definition 1.2. For x G 2^^, ^ni^x) denotes the result of running Pn on 
input string x. 

For any n, x it is easily seen that there is a eountable ordinal a so that 
the behaviour of (pn{x) has “settled” by stage a: either it has halted or it 
has entered a permanent looping eyele: notationally, either (pn{x)iy for 
some y G 2“, or else (pp{x)'\. For a maehine with an oraele / G 2^^ we 

^ A compression is a map of the form {xoyozoxiyizi . . .) {zqZiZ 2 ■ ■ •); although clearly 
one-tape computable on its own, one cannot compose it in a one-tape computable fashion with 
the other steps of the simulation. One needs a flag or some device such as further scratch pad 
(cf. [HamSeaOl] Theorem 2.3.) that is not part of the final output to allow us to know when 
the final compression is complete, and the one-tape computation may halt. 
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allow the machine to receive 0/1 answers to queries of the form “Does 
f{n) = 1”? We write for “program p with input x and oracle 

/ eventually has a settled y on its output tape”; in other words, there 
is a time v, so that for all later times ‘pl{x) has y sitting on its output 
tape — although without requiring that Lp*p{x)[. The relations 

and “ 92 ^ are A 2 relations of /, x, {y). We let wo denote 
the set of reals in 2 “^ that code wellorderings of u. 

Definition 1.3. We let X G yV if and only if there is a p G cu such that 
(fp{0)lx; and we say that W is the class of writable reals. 

Fact 1.4. The writable ordinals, (those ordinals a so that there is a; G 
W n WO with 1 1 a; 1 1 = a) form an initial segment of the countable ordi- 
nals. [HamLewOO, Theorems 3.7 & 8.3] 

Definition 1.5. Let A ;= sup{||a;|| : x G woflW}, and 7 := sup{o; : 
there is some p G cu such that <Pp(0)| and halts in exactly a steps}. 

Definition 1.6. We define SW := { X G 2‘^ : there is some p such that 
<Pp(0)}a;}, and call the set of eventually writable reals. Moreover, 
C := sup{||a;|| : x G WOflf^W). 

Definition 1.7. We define the set of accidentally writable reals as fol- 
lows: AW := {a; G 2*^ ; there is some p such that x appears on any 
tape at any time of the computation of <Pp(0) }. Let S := sup{||t|| : t G 
AW n wo}. 

The accidentally writable reals are thus likely to be transient. (An 
equivalent class of reals is obtained by restricting the appearance of such 
reals to the output tape alone.) Clearly AW ^ £W 3 W. We let HC 
denote the class of hereditarily countable sets. 

Definition 1.8. We write H(A) (H((/), H(S) respectively) for the class 
of sets y G HC so that 3p G W (£W, AW respectively) with y coding y. 

We use the notation W-^, \^ ... , (A-^), . . . for the notions relativi- 

sed to a real /. 
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Definition 1.9 (The weak jump). For / G 2“^ , define f'^ to be the set 

{neu : 

It is easy to see that / — *> /^ is a A 2 function. The notions of g is a 
decidable (or semi-decidable) set of integers is the natural one: g must 
have a totally (or partially) computable characteristic function: 

Definition 1.10. (7 G 2“ is semi-decidable in the real f if for some e we 
have for all n G a; 



n e gif and only if 

As usual g is decidable (in /) if both it and its complement are semi- 
decidable (in /). 

For the ordinary notion of Turing computability one has the equiva- 
lence between the semi-decidable, i.e., the recursively enumerable, sets 
of integers as those that are the domain of some computable function, 
with, as alternative, those that are the range of such. One can establish 
this by observing that from the universality of Kleene’s T-predicate one 
may simply test integers to see if they code a whole course of compu- 
tation of the relevant function, and seeing if they terminate in a desired 
number output. The point is that the course of computation can be coded 
as something within the domain of the Turing machine. But is that the 
case for infinite time Turing machines? Or do we have in the phraseology 
of Sacks [SacSO], a “violation of parity” (between the type of the object 
in the domain of the computation, and that of the computation itself)? 
In essence we are asking whether (a code for) a halting computation can 
also be the result of a halting output of another computation. For this it is 
enough to have a real code for the length of any halting computation to 
be a potential output of some halting computation. Once we have such a 
code to hand we can then write a program simulating the original course 
of computation along that ordinal, and so output a code for the course of 
that computation. In the terms we have defined, we are asking whether 
7 < A. This had been the principal open problem left from [HamLewOO]. 
The answer is affirmative. 

Theorem 1.11. The ordinals A and 7 are the same, and by relativisation 
we also have for all / G 2*^ that = 7 ^. [WeloOOb, Theorem 1.1] 

Compare Hamkins’s definition on page 154 of this volume. 
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Once we have this result we ean answer a number of questions, as to 
what the deeidable sets of integers are, what is eomplexity of the weak 
jump operator, what are the sets H(A) ete. 

Theorem 1.12. For i9 G {A, C, S} the models H(i9) and L,? are the same, 
and similarly for any / G 2^^ and G {A'^, S-^}, Hf(i9) = L^[/]. 

[WeloOOb, Corollaries 3.1, 3.3, 3.5] 

Corollary 1.13. The deeidable sets of integers are preeisely those of 

p{u) n La. 

The relationship between the various H(i9) sets is given by: 

Theorem 1.14 (The (A, (, S) Theorem^). 

1- La -<Si L^ Ls. 

2. (A, C, S) is the lexieographieally least triple of ordinals satisfying 
this relation. 

This theorem implies that we may eonsider the definable sets of inte- 
gers as those Si -definable inside the least model of KP with a transitive 
S 2 -end extension. It further implies that La, L^ are admissible sets sat- 
isfying strong refleetion properties. For example, there are unboundedly 
many a below any of the ordinals A, C, S where Lq |= S 2 -KP (mean- 
ing Kripke-Platek set theory with S 2 -Colleetion and A 2 -Comprehension 
axioms). 

By appealing to Theorem 1.11 as 7 -^ = A-^, Theorem 1.12, and by 
eonsidering the running of sueh maehines inside La/ [/] one sees: 

Corollary 1.15. The set is an element of Si( La/[/]). 

As is not an /-deeidable set of integers, it eannot lie inside La/[/]- 
It is thus perhaps unsurprisingly a Si-mastereode for La. 

Corollary 1.16. The set f'^ is (ordinary) Turing isomorphie to the set 
Si-Th(LA), i.e., the eomplete Si-theory of La. [WeloOOa] 

There is the natural reducibility relation assoeiated to reals: 



^ Cf. [WeloOOa], 
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Definition 1.17. For x,y E 2^^ we write x <oo y if there is an n such 
that for all k we have 

(pl^{k)il iff x{k) = 1 & v2^(/c)|0 iff x{k) = 0 . 

Corollary 1.15 enables one to see that the assignment / i— > A-^ satis- 
fies the usual Spector Criterion: 

Lemma 1.18. The statement x <oo y is equivalent to a; G \j\y[y\, thus 
the statement x <oo y implies that A^ < A^; and the assignment a; i— > A® 
then satisfies a Spector criterion: 

X <oo y implies {x'^ <oo y A"" < A^). 

The analogy of <oo with ordinary Turing reducibility is explored in 
[HamLewOO]. We take Lemma 1.18 as indicating that the proper analogy 
here is rather with something intermediate between hyperdegrees and 
that of A^-degrees^. 

The principal open question we can formulate at this point is the fol- 
lowing. 

Question 6. Let D be a countable set of infinite time degrees. Does D 
have a minimal upper bound? 

This question remains unresolved for hyperdegrees. For A 2 -degrees 
there are two differing but complete pictures, both with affirmative an- 
swers, depending on whether M C L or not (cf. [Frio74]). There are some 
partial results on Question 1 in [Welo99], but they are very partial, and 
the problem looks difficult. 

1.1 Eventually infinite time degrees 

There is one sense in which one could argue that the ordinal ( is perhaps 
more fundamental than A. It is after all the point by which the behaviour 



® With the assignments x u!i (where wf is the least ordinal not recursive in x) and x 
A 2 (a:), respectively. 

^ [Welo99] uses some unexplained notation which we here clarify: F{e, f) = (3 abbreviates 
’Ve(0)i in exactly /3 steps” 
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of the machine on a computation of the form (pe{k) is determined: either 
it has halted, or it has begun to loop permanently at some ordinal C < C-* 
We could thus define a reducibility relation “x is eventually writable 
from y” and a corresponding jump: 

Definition 1.19. We write X <eoo y if a; G £W^, and x for the set of 
indices {p E u : there is some y G such that <^p(0)ti/}. 

We have parallel to the above: 

Lemma 1.20. The statement x <eoo y is equivalent to a; G Li^y [y] ; thus 
the statement x <eoo y implies and the assignment a; h- > then 

satisfies the Spector criterion: 

X <eoo y implies {x <eoo y C < O- 

Lemma 1.21. The real x is Turing isomorphic to the complete 
S 2 (L^rc [x]) set of integers. 

Again the variant of Question 1 arises. 

Question 7. Let D be a countable set of eventually infinite time degrees. 
Does D have a <eoo minimal upper bound? 

Let F = { X E 2‘^ : a; G L^a:}. We note that it makes no difference 
whether we write or in this definition, since there is a program that 
given an x as input, searches for its L-rank. Once found we can ensure 
the program halts. Hence if a; G F then x E There is an analogy 
here with Ci, the largest thin Ilj set of quickly constructible reals. ^ 

We may define a natural hierarchy of eventually infinite degrees 
through F in the spirit of [Kec75] (where this is done for Q and hy- 
perdegrees) by oo := [0]eoo, e„+i := [e«]eoo, := lub{e« : a < y} for 
limit ordinals y if defined. (Here “lub” abbreviates “least upper bound” 
in the degree ordering.) 

Question 8. What is the natural length of this hierarchy? That is, what 
is the least g so that is undefined^®? 

* Moreover there is e so that this upper bound is attained here. It would take us too far afield 
here, but it is essentially the pattern of the universal machine ifiu’s output tape at stage that 
is recursively isomorphic to certain sets of interest occurring in the revision theory of truth 
based on Herzberger sequences. 

^ Here, Ci := {a: : a; € Cf. [Kec75] or [MosiSO, 4F.4] for the basic properties and 

structure of Ci. 

The version of this question for the infinite time degrees has a known answer: the natural 
hierarchy is undefined even at the tuth stage. 
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2 Higher Type Computation 

We briefly review again the relevant deflnitions from [HamLewOO]: For 
A C 2“ we write (Pp{x) for the function deflned by the pth. program in a 
recursive listing of programs allowing queries of the set A, as to whether 
a real currently on the scratch tape is, or is not, in A (and receiving a 
simple 0/1 answer), using real input x. 

Definition 2.1 (The strong jump). We define 0^ to be the set {(n, x) E 
cu X 2“ : Lpn{x)i}. For A C 2“^ define A'^ := {{n,x) E to x 2"^ : 

There is an associated reducibility relation associated to sets of reals: 

Definition 2.2. We write A <oo B if there is an n such that for all x E 2‘-^ 
we have 

(a;)J,l if and only if x E A, and 

(Pn if and only iix ^ A. 

We make specifically semi-decidability into a definition: 

Definition 2.3. A set of reals A is semi-decidable in a set of reals B if 
and only if: 

3nVa; G 2^^ [ip^ if and only if x E A\ . 

A set of reals A is semi-decidable in a set of reals B and a real y E 2"^ 
if and only if: 

3nVa; G 2‘^ \p^'y{x)[l if and only if a; G A] . 

We want a notion of “semi-decidability” that results in a pointclass 
of sets closed under continuous preimages. Thus (when sufficient deter- 
minacy) it shall have a Wadge rank, which in turn gives a measure of 
complexity to the notion of semi-decidability we have defined'^. 

The notion “A is decidable in B” ( respectively “in a real and B ”) 
is as usual: this relation holds if both A and -lA are semi-decidable in B 
(respectively in a real and B). 

Cf. [HamLewOO]; compare also Hamkins’s definition on page 154 of this volume. 

Wadge defined the following ordering on sets of reals: A <w B if there is a continuous 
reduction of A to B: i.e., there is a continuous / : 2“ ^ 2" so that x G A if and only 
if f{x) e B. Then <w is easily seen to be reflexive and transitive (cf. Section 3 of the 
introduction to this volume; p. 6). This will give us a notion of Wadge rank of the sets of 
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Definition 2.4. A <r^oo B denotes that A is decidable in B and a real. 

For the rest of this section “semi-decidability” will abbreviate “semi- 
decidability in a real” — the boldface notion. 

Definition 2.5. Define Fq ;= {A C 2“ : A is semi-decidable from a 
real} and Aq ;= Fq fi -iFq. 

Theorem 2.6 (Hamkins, Lewis). Aq C A 2 is a a-algebra, closed un- 
der Suslin’s operation A, properly containing the Selivanovski C-sets^^. 
[HamLewOO] 

It is easy to verify that the semi-decidable in 2 : sets of reals are prov- 
ably A} ( 2 :) and thus, by a result of Solovay (a proof is given in 
[Fen2Noro74]), are all absolutely measurable, and enjoy the property of 
Baire. 

There are two questions related to hierarchies with a classical flavour 
(as opposed to the descriptive set-theoretic viewpoint based on the later 
effective theory of Moschovakis and others). The first is to ask how the 
semi-decidable sets sit with relation to the Kolmogorov R-sets. 

Question 9. Classify the semi-decidable sets of reals within the hierar- 
chy of the R-sets. 

We conjecture that they sit very low down in this hierarchy. The 
reader may consult [Hin78, V.4 & V.5] for an account of classical hi- 
erarchies through A 2 . There is another notion of generalised computa- 
tion, due to Blackwell [Blao78], the '"Borel programmable functions ” — 
derived from the theory of dynamic programming in probability theory. 
For this, let X, Y be perfect polish spaces, and 2‘^ be Cantor space. A 
program is a function p : 2‘^ — 2“^ so that for all n e u, x{n) < p{x){n) 

reals in any given pointclass F provided we know that <w restricted to F is wellfounded. 
Assuming sufficient determinacy Wadge’s Lemma (c/[Mosi80, 7D.3]) asserts that <w so 
restricted is (virtually) a linear ordering, and a theorem of Martin (c/[Mosi80, 7D.14]) asserts 
-still assuming sufficient determinacy- that it is wellfounded. For F equal to the class of Borel 
sets no determinacy is required (indeed, unlike Borel Determinacy, these facts can be proven 
in second order arithmetic). Assuming the full axiom of determinacy, AD, then the Wadge 
hierarchy so obtained has rank 0 := sup {7 : there is some surjective p : R ^ 7 }. 

The Selivanovski C-sets are the smallest cr-algebra of sets containing the Borel sets, and closed 
under operation A. 
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(thus we are really thinking -vzacharacteristic functions- of p as operat- 
ing on a subset xofu and returning a possibly larger set p{x)). We iter- 
ate p in an obvious way, and let Pa be the ath iterate, with p\{x){n) = 
sup^<xPa(n) for all n at limits A. Let pn be the fixed point function 
attained at some < cui. A Borel function e : X — 2“^ is called an 
encoder, a Borel d : 2“^ — F is a decoder. Let c* be the constant function 
on uj with value i. 

Definition 2.7. A function f : X ^ Y between perfect polish spaces 
X, Y is Borel programmable if it is of the form d o p^ o e for some 
d, p, e of the above form. A set A C X is Borel programmable if its 
characteristic function qa '■ X ^ 2‘^ is (where Qa{x) = Ci if a; G A and 
qa{x) = Co if a; ^ A.) 

Question 10. Are the infinite time Turing machine decidable sets all 
Borel programmable sets? 

It is easy to see from the definition of Borel programmable func- 
tions, that their computation could be effected on an infinite time Tur- 
ing machine (equipped with oracles for the requisite Borel codes of the 
functions concerned). The question concerns the reverse inclusion. The 
question prima facie is not unreasonable, since by a result of Burgess and 
Lockhart [BurLoc83] the Borel programmable sets also properly include 
the C-sets. However we conjecture the answer is negative. 

In [HamLew02] a positive solution to Post’s problem (whether there 
can be A with 0 <oo A <oo 0^) is proposed. In fact two countable 
semi-decidable sets A, B C 2‘^, which could be construed as constructed 
definably over La, are seen to be <oo -incomparable viaa Friedberg- 
Muchnik type construction. Since the sets constructed are countable, they 
are obviously both of degree 0 when considering the boldface reducibility 
of decidability in reals. One can “remedy” that as follows 

Lemma 2.8 (V=L). The set F of fast reals satisfies: 0 <00 F <oo 0^. 
Moreover F is not decidable in a real, and 0^ is not decidable in any real 
and F; that is 0 <r^oo F <r,oo 0^. 

The set F is thus, in L, a (boldface) strictly intermediate set. That 
some kind of set theoretic hypothesis is necessary to build intermediate 
sets will be seen below. 

We are grateful to John Burgess for bringing this example to our attention — albeit in a rather 

different context. 




234 



PHILIP D. WELCH 



We take the view here that one tends not to eonsider the relation of 
“A is in B” between sets of reals (although of eourse this makes 
perfeet sense for sets of integers). We should like to generalise the no- 
tion of boldfaee semi-deeidability that is afforded by Kleene degrees (see 
[HrbSim280] for a diseussion of this notion). Briefly, for sets of reals 
A, B one writes A <k B (“A is Kleene reeursive in 5”) iff there is a real 
y so that the eharaeteristies funetion xa of A is reeursive (in the sense of 
Kleene [Kle59]) in y, xb and the existential integer quantifier The 
eomputational model here differs from the infinite time Turing maehines, 
but one ean view it as a maehine equipped with ability to quiz an oraele 
for B and y, with a eountably infinite memory, and an ability to seareh 
that memory in di finite amount of time. 

The Kleene reeursive sets are then the Borel sets, and the Kleene 
semi-reeursive sets are the eoanalytie sets. Solovay had shown {cf. 
[SolVl]) that under AD, the axiom of determinaey, the Kleene degrees 
are well-ordered. The eomplete semi-reeursive set is wo, the 11} set of 
reals eoding wellorders. The nature of the redueibility ordering <k de- 
pends on one’s universe of diseourse: eontrasting with Solovay’s AD re- 
sult already mentioned, in L, or in set generie extensions thereof, there 
are 2^° <K-ineomparable semi-reeursive sets below the degree of wo 
{cf. [HrbSim280]). A sharper result than Solovay’s (no pun intended) 
is the faet that if II}-determinaey holds then all Kleene semi-reeursive, 
non-reeursive sets have the same Kleene degree. This is a result of Steel 
[Steo80]. The eonverse also holds by Harrington [Haro78, Theorem 4.4]. 
We should expeet an entirely analogous pieture to obtain for the degrees 
of the semi-deeidable sets defined above. 

In the following we use freely the natural generalisations of the def- 
initions and notations from the first seetion. We thus, for example, write 
“A^” for the first ordinal that is not (eoded by) the output of any halting 
eomputation with oraele B, on input 0. 

It follows from a version of the (A, C, S) Theorem 1.14 that the sets 
of reals semi-deeidable in a; G 2“^ are those sets A expressible with a Si 
formula foi'^o, as follows: 



y e Aif and only if [x, y] \= fo{x, y). 
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The relation A is semi-decidable in B (A, B C 2‘^) and x & 2‘^ can be 
seen to be given similarly as: 

y e Aif and only if [x, y, B] \= ^/Jo{x, y, B). 

In the latter equivalence, we should have precisely the notion of A 
being Kleene semi-recursive to B, if one simply replaced by 

simply mimicking a (version equivalent to) 
Kleene’s definition here. Suppose we are given any function f :V ^ On 
of ordinary Turing degrees to countable ordinals that is definable via a Si 
formula ni) so that for any (ordinary) Turing degree [i/]r we have 
/(Mt) = 01 iff L[i/] 1= '^(y, a); then we may define a slice through A 2 
by defining a lightface pointclass Tq as follows: A G Tq if and only if for 
some formula -() (no, we have a; G A if and only if L /(j.) [x] |= -()(a;).^^ 
How high a rank / has in modulo the Martin measure {cf. [Kauo94, 
p. 386]), assuming say Det(A 2 ), determines the complexity of the class 
Tq. By some measures A"^ is a large ordinal, and we are dealing with a 
complex class. 

We may then, to be specific, ask questions about the semi-decidable 
sets below 0^ as follows: 

1. Is there A C 2“^, A G Fq, A not decidable in a real, but satisfying that 

0^ is not decidable in a real and A? 

2. Are there A, i? G Fq neither of which is decidable in a real from the 

other? 

The outcome, just as for Kleene degrees, depends on the set theoreti- 
cal universe one inhabits. 

Theorem 2.9. In L, the constructible universe, (or in set generic exten- 
sions thereof) there are A, B as in 2. 

We have not checked, but fully expect that, there are 2^° many such 
incomparable sets A. We list a number of problems, which are supposed 
to elucidate the boldface reducibility ordering in the constructible uni- 
verse. We expect their answer (and solution) to be similar to that for 
Kleene degrees. 

A boldface definition would add in a real parameter. 
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Question 11. Assume V = L. 

1. Can there be a semi-deeidable set A of minimal <r_oo"degree? 

2. Can there be infinite deseending sequenees of <r,oo-degrees? 

3. Are the <r,oo-degrees of semi-deeidable sets dense between 0 and 0^? 

(We eonjeeture the answer to 3. to be affirmative, thus solving the other 
two parts.) In the following we let for F (any pointelass) Boolean(F) 
denote the pointelass obtained from the sets of F and elosing up under 
eomplementation and finite unions. 

Theorem 2.10. Det(Boolean(Fo)) implies that every A G Fo\Aq satis- 
fies: 0^ and A are mutually deeidable in a real. 

Thus suffieient determinaey ensures the answers to both questions is 
negative. In essenee all one has to do for this latter theorem is to quote 
a result of Steel (cf. [SteoSO]) that for any boldfaee pointelass F the de- 
terminaey of sets in Boolean(F) ensures that any set not F-self-dual is 
eontinuously redueible to another sueh set. 

There remains the question of how to measure the eomplexity of Fq. 
One method is to aseertain how mueh det4erminaey Det(Boolean(Fo)) 
actually is. Can one find an equivalence in terms of inner models of large 
cardinal axioms? We formulate this as follows: 

Question 12. Determine the strength of Det(Boolean(Fo)). Can the 
sharp of an inner model of some large cardinal axiom be found which 
is equivalent to this? 

In general the direction from a sharp to the determinaey is difficult. 
For the theory of Kleene degrees we have already remarked that there 
is a precise answer: Det(Boolean(IIJ)) yields, by the above comments, 
that all non-Borel co-analytic sets are Kleene mutually reducible to the 
complete Ilj set. This conclusion had been obtained by Steel assuming 
just Det(nj).^^ 

We can appeal to methods of Steel [Steo82b], using Friedman- style 
games to show that Det(Fo) implies reasonable large cardinal strength. 
We first have: 



Of course Harrington showed that Det(Ilj) implied the existence of for any real x, and 
this was known by work of Martin to imply Det(Boolean(Ilj)). 
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Theorem 2.11. The following is a (lightface) decidable relation 

“(x, y) G 2‘^ codes countable premice M,N <* O' and M N 
in the (pre)mouse ordering.” 

In the above the relations and <* are the canonical (pre)mouse order- 
ings:^^ For premice M <* iV is to be interpreted as “In the comparison 
coiteration of M with N to models then either (i) ■§ is least so 

that one of the models is illfounded, and that model is or (ii) is 
an initial segment of The notation 0^ (read as “zero pistol’") denotes 
the sharp for an inner model of a strong cardinal (if it exists). Using the 
fact that Det(ro) implies Det(Il}) we may use the Sg-correctness of the 
core model built assuming there is no inner model with a strong cardinal 
(c/[SteoWelo94]) and obtain: 

Theorem 2.12. Det(To) implies that 0^ exists. 

In fact one has for any real x. But this seems far from an optimal 
result. Indeed to ask a question as to lower bounds: 

Question 13. Does the mutual decidability in reals of all A, B E To\Ao 
imply exists for all a; G M? Does it imply x* exists? 

We expect the answers should be affirmative. 



17 



Cf. [Zem02, § 5.4]. 
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